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On December 30, 2004, the Examiner made a final rejection to pending Claims 33, 38-40 
and 44-54. Claims 34 and 35 were canceled in the Amendment and Response to the final Office 
Action submitted February 28, 2005 . A Notice of Appeal was filed on May 27, 2005 . 

Appellants hereby appeal to the Board of Patent Appeals and Interferences from the last 
decision of the Examiner. A request for a 1 month extension of time is filed concurrently 
herewith. ' 

The following constitutes Appellants' Brief on Appeal. 
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1. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, CaUfomia, by an 
assignment of the patent appHcation U.S. Serial No. 09/946,374 recorded January 8, 2002, at 
Reel 012288 and Frame 0504. 

2. RELATED APPEALS AND INTERFERENCES 

There are no related appeals or interferences known to Appellants, Appellants' legal 
representative, or Appellants' assignee that will directly affect or be directly affected by or have a 
bearing on the Board's decision in the present appeal. 

3. STATUS OF CLAIMS 

Claims 33, 38-40 and 44-54 are in this application. 
Claims 1-32, 34-37 and 41-43 are canceled. 

Claims 33, 38-40 and 44-54 stand rejected and Appellants appeal the rejection of these 

claims. 

A copy of the rejected claims involved in the present Appeal is provided in the Claims 
Appendix. 

4. STATUS OF AMENDMENTS 

An amendment to the claims was submitted with Appellants' Response to Final Office 
Action filed February 28, 2005. In the Advisory Action mailed August 1 1, 2005, the Examiner 
indicated that these amendments would be entered for the purpose of the appeal. Accordingly, 
the claims listed in the Claims Appendix incorporate the amendment of February 28, 2005. 

5. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated nucleic acid 
comprising the nucleic acid sequence of SEQ ID NO:337, the full-length coding sequence of the 
nucleic acid sequence of SEQ ID NO:337, or the full-length coding sequence of the cDNA 
deposited under ATCC accession number 203322 (Claims 33, 38, 39, and 40), a vector 
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comprising the nucleic acid (Claims 44 and 45), and a host cell comprising the nucleic acid 
(Claims 46 and 47). The claims are further directed to an isolated nucleic acid molecule at least 
20, 50, 60, 70, 80, 90, or 100 nucleotides in length that hybridizes under stringent conditions to 
. the nucleic acid sequence of SEQ ID NO:337 or a complement thereof, or to the full-length 
coding sequence of the cDNA deposited under ATCC accession number 203322 or a 
complement thereof (Claims 46-54). 

The full-length PR01555 polypeptide having the amino acid sequence of SEQ K) 
NO:338 is described in the specification at page 29, lines 13-17, page 350, lines 27-30, in Figure 
198 and in SEQ ID NO:338. The cDNA nucleic acid encoding PR01555 is described in the 
specification in Example 102, in Figure 197 and in SEQ ID NO:337. Page 297, lines 7-11 of the 
specification provides the description for Figures 197 and 198. Fragments of at least about 20, 
50, 60, 70, 80, 90, or 100 nucleotides in length of nucleic acids encoding PRO are described at, 
for example, page 282, lines 12-19. 

The isolation of cDNA clones encoding PR01555 of SEQ ID NO:338 is described in 
Example 102. Methods for isolating PRO cDNA is generally set forth in the specification at, for 
example page 359, lines 1 1-34. Methods for selection and transformation of host cells with PRO 
cDNA is generally set forth in the specification at, for example, page 359, line 36, to page 361, 
line 24. Methods for selecting a vector are generally set forth in the specification at, for example, 
page 361 , line 26, to page 363, line 25. The use of polynucleotides encoding PRO as 
hybridization probes is described at, for example, page 364, lines 25-38, and page 481, hnes 1- 
11. Stringent conditions are defined at, for example, page 308, line 38, to page 309, line 7. 
Finally, Example 143, in the specification at page 494, line 20, to page 508, line 28, sets forth a 
Gene AmpUfication assay which shows that the PR01555 gene is amplified in the genome of 
certain human lung and colon cancers (page 507, lines 23-34, and Table 8). 
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6. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

L Whether Claims 33, 38-40 and 44-54 satisfy the utility requirement of 35 U.S.C. 

§101. 

n. Whether Claims 33, 38-40 and 44-54 satisfy the enablement requirement of 35 
U.S.C. §112, first paragraph. 

7. ARGUMENT 
Summary of the Arguments: 

Issue I: Utility 

Patentable utiHty of the claimed PR01555 nucleic acid molecules is based upon the gene 
amplification data for the gene encoding the PRO 1555 polypeptide. The specification discloses 
that the gene encoding PRO 1555 showed significant ampUfication, ranging from 2 to 32 fold , in 
24 lung, colon, breast, kidney and testis tumors. The Declaration of Dr. Audrey Goddard, 
submitted with Appellants' Response filed September 16, 2004, explains that a gene identified as 
being amplified at least 2-fold by the disclosed gene amplification assay in a tumor sample 
relative to a normal sample is useful as a marker for the diagnosis of cancen for monitoring 
cancer development and/or for measuring the efficacy of cancer therapy. Accordingly, the 
Examiner's assertion that the specification "merely demonstrates that the PRO 1555 nucleic acid 
was amplified in some lung or colon cancers, to a minor degree (about 2.5 fold)." (Page 7 of the 
Office Action mailed December 30, 2004, emphasis added) is both factually and scientifically 
incorrect.. By referring to the 2-fold to 32-fold ampUfication of the PR01555 gene in lung, 
colon, breast, kidney and testis tumors as "very small," or "minor," the Examiner ignores the 
teachings of an expert declaration without any basis, or without presenting any evidence to the 
contrary . 

The Examiner has asserted that "[ajssays measuring DNA copy number found positive 
staining in about half the samples of several cancers, as well as about half the control samples. 
This inconsistent staining is not useful for diagnosing cancer, since a positive result does not 
indicate the presence of a cancer and a negative result does not indicate a lack of cancer." (Page 
1 of the Advisory Action mailed August 11, 2005). 
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Part of this statement is factually incorrect, since the specification does not disclose that the 
PR01555 gene was amplified in any control samples. As to the rest of the statement, as any 
skilled artisan in the field of oncology would easily appreciate, not all tumo r markers are 
generally associated with every tumon or even with most tumors . Therefore, whether the 
PR01555 gene is amplified in 24 lung, colon, breast, kidney and testis tumors all lung, colon, 
breast, kidney and testis tumors is not relevant to its identification as a tumor marker, or its 
patentable utility. Rather, the fact that the amplification data for PR01555 is considered 
significant is what lends support to its usefiilness as a tumor marker. If the goal is to diagnose 
lung, colon, breast, kidney or testis cancer, then contrary to the Examiner's assertion, a positive 
result does indicate the presence of cancer, while a negative result is not conclusive, and requires 
follow up testing. 

The Examiner has additionally asserted that that the gene amplification data discussed 
above does not provide utility for the claimed PR01555 nucleic acids because "there is no 
evidence regarding whether or not PR01555 mRNA or protein levels are also increased in 
cancer." (Page 5 of the Office Action mailed December 30, 2004). The Examiner has cited 
references by Pennica et al and Haynes et al. as evidence that an increase in gene copy number 
does not necessarily result in increased gene expression and protein expression. 

First of all, the claims are directed to nucleic acids, not polypeptides, therefore, the issue 
of whether there is a correlation between gene amplification and polypeptide expression levels is 
irrelevant . Similarly irrelevant is whether or not PRO! 555 mRNA levels are also increased. One 
of skill in the art would understand how to use the claimed nucleic acids to detect amplification 
of the gene encoding PR01555, and how to use the gene amplification results to diagnose cancer. 
Thus the question of whether or not PR01555 mRNA or polypeptide levels are also increased in 
these cancers has no relevance to the utility of the claimed nucleic acid molecules. 

Finally, the Examiner has cited Hu etal., in support of the assertion that "the literature 
cautions researchers fi-om drawing conclusions based on small changes in transcript levels." 
(Page 5 of the Office Action mailed December 30, 2004). Appellants submit that the evidentiary 
standard to be used throughout ex parte examination in setting forth a rejection is a 
preponderance of the totality of the evidence under consideration. 
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Thus, to overcome the presumption of truth that an assertion of utiUty by the applicant enjoys, the 
Examiner must estabhsh that it is more Hkelv than not that one of ordinary skill in the art would 
doubt the truth of the statement of utility. Only after the Examiner has made a proper prima facie 
showing of lack of utility, does the burden of rebuttal shift to the applicant. Hu et al. does not 
suffice to.show that a lack of correlation between gene amplification data and the biological 
significance of cancer genes is typical 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed nucleic acids that encode the PR01555 polypeptide. 

Issue II: Enablement 

Claims 33, 38-40 and 44-54 stand rejected under 35 U.S.C. §1 12, first paragraph, allegedly 
"since the claimed invention is not supported by either a specific and substantial asserted utility or a 
well established utility ... one skilled in the art clearly would not know how to use the claimed 
invention." (Page 4 of the Office Action mailed December 30, 2004). 

Appellants submit that, as discussed above, the nucleic acids that encode the PR01555 
polypeptide, or firagments thereof, have utility in the diagnosis of cancer. Based on such a utility, 
one of skill in the art would know exactly how to use the claimed nucleic acids for the diagnosis 
of cancer, without any undue experimentation. 

These arguments are all discussed in fiirther detail below xmder the appropriate headings. 

ISSUE I: Claims 33, 38-40 and 44-54 satisfy the utility requirement of 35 USC §101 

Claims 33, 38-40 and 44-54 stand rejected under 35 U.S.C. §101 because allegedly "the 
claimed invention lacks a credible, specific and substantial asserted utiUty or a well established 
utility." (Page 3 of the Office Action mailed March 1 8, 2004). 

Appellants submit, for the reasons set forth below, that the specification discloses at least 
one credible, substantial and specific asserted utility for the claimed nucleic acids encoding the 
PR01555 polypeptide. 
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A. The Legal Standard for Utility 

According to 35 U.S.C. § 101: 

Whoever invents or discovers any new and useful process, machine, manufacture, or 
composition of matter, or any new and useful improvement thereof, may obtain a 
patent therefor, subject to the conditions and requirements of this title. (Emphasis 
added.) 

In interpreting the utility requirement, in Brenner v. Manson^ the Supreme Court held that 
the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent apphcant disclose a "substantial utiUty" for his or 
her invention, i.e. a utility "where specific benefit exists in currently available form."^ The Court 
concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 
commerce rather than the realm of philosophy."*^ 

Later, in Nelson v. Bowler the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may not 
establish a specific therapeutic use. The court held that "since it is crucial to provide researchers 
with an incentive to disclose pharmaceutical activities in as many compounds as possible, we 
conclude adequate proof of any such activity constitutes a showing of practical utility."^ 

In Cross v. lizuka^ the C. A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "zn vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing.Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of vivo test results, 



. ^ Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

^ Id, at 534, 148 U.S.P.Q. (BNA) at 695. 
' Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

' Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 
^ Id, at 856, 206 U.S.P.Q. (BNA) at 883. 

' Cross V. lizuka, 753 F.2d .1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 
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i.e. there is a reasonable correlation there between."^ The court perceived "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 
practical utility."^ 

The case law has also clearly established that applicants' statements of utility are usually 
sufficient, unless such statement of utility is unbelievable on its face.^ The PTO has the initial 
burden to prove that applicants' claims of usefuhiess are not behevable on their face.^^ hi 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. § 101, "unless there is a reason for one skilled in the 
art to question the objective truth of the statement of utihty or its scope." , 

Compliance with 35 U.S.C. §101 is a question of fact.'^ The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 
totality of the evidence under consideration.'^ Thus, to overcome the presumption of truth that 
an assertion of utility by the applicant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the appUcant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines")'^, which acknowledge that an invention complies with the utility 

' Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 

" Id.. 

' In re.Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 
. "» Ibid. 

" In re Langer, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

See also In reJolles, 628 F.2d 1322, 206 U.S.P.Q. 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
U.S.P.Q. 351 (1965); In reSichert, 566 F.2d 1154, 1159, 196 U.S.P.Q. 209, 212-13 (C.C.P.A. 1977). 

Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert, denied, 469 
US 835 (1984). 

In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 
66 Fed. Reg. 1092(2001). 
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requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when it 
is particular to the subject matter claimed. For example, it is generally not enough to state that a 
nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be carefiil not to interpret the phrase "immediate benefit to the pubUc" 
or similar formulations used in certain court decisions to mean that products or services based on 
the claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an appUcant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial' utility."^^ Lideed, the Guidelines for Examination of AppUcations for 
Compliance With the Utility Requirement,^^ gives the following instruction to patent examiners: 
"If the applicant has asserted that the claimed invention is useful for any particular practical 
purpose . . . and the assertion would be considered credible by a person of ordinary skill in the 
art, do not impose a rejection based on lack of utihty." 

B. The Data and Documentary Evidence Supporting a Patentable Utility 

Appellants respectfully submit that Appellants rely on the gene amplification data for 
patentable utihty of the claimed nucleic acids encoding the PR01555 polypeptide, and that the 
gene ampUfication data for the gene encoding the PR01555 polypeptide is clearly disclosed in 
the instant specification under Example 143. 

It was well known in the art at the time the invention was made that gene amplification is 
an essential mechanism for oncogene activation. The gene amplification assay is well-described 
in Example 143 of the present application. Example 143 discloses that the inventors isolated 
genomic DNA from a variety of primary cancers and cancer cell lines that are listed in Table 8, 
including primary lung and colon tumors of the type and stage indicated in Table 7. 

'Vm.P.E.P. §2107.01. 
M.P.E.P. §2107 n (B)(1). 
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As a negative control, DNA was isolated from the cells of ten normal healthy individuals, which 
was pooled and used as a control. Gene amplification was monitored using real-time quantitative 
TaqMan^^ PGR. Table 8 shows the resulting gene amplification data. Further, Example 143 
explains that the results of TaqMan^^ PGR are reported in AGt units, wherein one unit 
corresponds to one PGR cycle or approximately a 2-fold amplification relative to control, two 
units correspond to 4-fold amplification, 3 units to 8-fold amplification etc. 

Appellants respectfiilly submit that a ACt value of at least 1.0 was observed for PR01555 
in at least 24 of the tumors and tumor cell lines listed in Table 8. The nucleic acids encoding 
PR01555 had AGt value of > 1.0 (1) in primary lung tumors: LT13, LT15, LT16, HF-000631, 
HF-000840, and HF-000842; (2) in lung carcinoma cell lines: A549, Galu-1, Galu-6, H441, 
H460, and SKMESl; (3) in primary colon tumors: GT15, GT16, CT17, and colon tumor centers 
HF-000539 and HF-000575; (4) in colon carcinoma cell lines: SW620, Golo320 and HGTl 16; 
(5) in breast tumor center HF-000545; (6) in kidney tumor center HF-00061 1 ; and (7) in testis 
tumor margin HF-000716 and testis tumor center HF-000733, PR01555 showed approximately 
1.04-4.99 AGt units which corresponds to 2^^-2'^^^ fold amplification or 2 fold to 32 -fold 
amplification in these tumors and tumor cell lines. Accordingly, the present specification clearly 
discloses overwhelming evidence that the gene encoding the PRO 1555 polypeptide is 
significantly amplified in a significant number of lung, colon, breast, kidney and testis tumors. 

The Examiner has asserted that "the specification provides data showing a very small 
increase in DNA copy number - about 2.3 fold - in many types of normal and cancerous tissue. 
(Page 5 of the Office Action mailed December 30, 2004). The Examiner has fiuther asserted that 
the specification "merely demonstrates that the PR01555 nucleic acid was ampUfied in some 
lung or colon cancers, to a minor degree (about 2.5 fold)." (Page 7 of the Office Action mailed 
December 30, 2004). 

Appellants submit that the Examiner seems have applied a heightened utility standard in 
this instance, which is legally incorrect. Appellants have shown that the gene encoding 
PR01755 demonstrated significant ampUfication, from 2.2 to 5.1 fold, in eight lung and colon 
tumors. As explained in the Declaration of Dr. Audrey Goddard (submitted with the Response 
filed August 31, 2004): 
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It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal (i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy 
number in the tumor sample relative to the normal sample serves as a basis for 
using relative gene copy number as quantitated by the TaqMan PGR technique 
as a diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. (Emphasis added). 

By referring to the 2.2-fold to 5.1-fold amplification of the PR01555 gene in various 

tumors as "very small," or "minor." the Examiner appears to ignore the teachings within an 

expert's declaration without anv basis, or without presenting any evidence to the contrary . 

Appellants respectfully draw the Board's attention to the Utility Examination Guidelines (Part 

nB, 66 Fed. Reg. 1098 (2001)) which state that: 

"Office personnel must accept an opinion from a qualified expert that is based 
upon relevant facts whose accuracy is not being questioned; it is improper to 
disregard the opinion solely because of a disagreement over the significance or 
meaning of the facts offered". 

Thus, given the absence of any evidence to the contrary, Appellants maintain that the 2 to 
32-fold amplification disclosed for the PRO 1555 gene is sijgnificant and forms the basis for the 
utility claimed herein. 

The Examiner has asserted that, "No mutation or translocation of PR01555 has been 
associated with any type of cancer versus normal tissue. It is not know whether PR01555 is 
expressed in corresponding normal tissues, and what the relative levels of expression are." (Page 
7 of the Office Action mailed December 30, 2004). hi fact, Example 143 of the specification 
discloses that the gene amplification data were obtained by comparing DNA from a variety of 
primary tumors, including breast, lung, colon, rectum, kidney, testis, lymph node and parathyroid 
tumors, and various tumor cell lines with pooled DNA isolated firom the blood cells often 
healthy donors. Thus the expression level of PR01555 was tested in control, normal tissue. 

Appellants ftirther point out that the negative control taught in the specification was 
known in the art at the time of filing, and accepted as a true negative control as demonstrated by 
use in peer reviewed publications . For example, in Pitti et aL (Exhibit F submitted with the 
Response filed September 16, 2004), the authors used the same quantitative TaqMan PGR assay 
described in the specification to study gene amplification in lung and colon cancer of DcR3, a 
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decoy receptor for Fas ligand. As described, Pitti et al analyzed DNA copy number "in genomic 
DNA from 35 primary lung and colon tumours, relative to pooled genomic DNA fr om peripheral 
blood leukocytes (PBU of 10 healthy donors ." (Page 701, col. 1; emphasis added). The authors 
also analyzed mRNA expression of DcR3 in primary tumor tissue sections and found tumor- 
specific expression, confirming the finding of frequent amphfication in tumors, and confirming 
that the pooled blood sample was a valid negative control for the gene amplification experiments. 
In Bieche et al (Exhibit G submitted with the Response filed September 16, 2004), the authors 
used the quantitative TaqMan PGR assay to study gene amplification of myc, ccndl and erbB2 in 
breast tumors. As their negative control, Bieche et al used normal leukocyte DNA derived from 
a small subset of the breast cancer patients (page 663). The authors note that "[t]he results of this 
study are consistent with those reported in the literature" (page 664, col. 2), thus confmning the 
validity of the negative control. Accordingly, the art demonstrates that pooled normal blood 
samples are considered to be a valid negative control for gene amplification experiments of the 
type described in the specification. 

The Examiner has asserted that "[ajssays measuring DNA copy number found positive 
staining in about half the samples of several cancers, as well as about half the control samples. 
This inconsistent staining is not usefiil for diagnosing cancer, since a positive result does not 
indicate the presence of a cancer and a negative result does not indicate a lack of cancer." (Page 
1 of the Advisory Action mailed August 11, 2005). 

Appellants submit that the Examiner appears to have misunderstood the data presented in 
the specification. In the first place, the assay did not utilize staining, but a fluorescent PGR- 
based technique. Second, the specification explains, "As a negative control, DNA was isolated 
from the cells of ten normal healthy individuals which was pooled and used as assay controls for 
the gene copy in healthy individuals (not shown)" (page 494, lines 31-33). Appellants also wish 
to clarify that all of the samples listed in Table 8 are tumor samples, not normal controls . Those 
samples not described in Table 7 are tumor centers or tumor cell lines (page 494, line 37, to page 
495, line 2). There is nothing in the specification to indicate that the PR01755 gene was 
amplified in any control samples. 
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Applicants further emphasize that they have shown significant DNA ampUfication in 24 
of the tumor samples and tumor cell lines listed in Table 8, Example 143 of the instant 
specification. The fact that not all tumors tested positive in this study does not make the gene 
amplification data less significant. As any skilled artisan in the field of oncology would easily 
appreciate, not all tumor markers are generallv associated with everv tum or, or even with most 
tumors. For example, the article by Hanna and Momin (submitted with the Response filed 
August 31, 2004), discloses that the known breast cancer marker HER-2/neu is "amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 40%-60% of intraductal breast 
carcinoma" (page 1, col. 1). 

hi fact, some tumor markers are usefiil for identifivdng rare malignancies . That is, the 
association of the tumor marker with a particular type of tumor lesion may be rare, or, the 
occurrence of that particular kind of tumor lesion itself may be rare. In either event, even these 
rare tumor markers which do not give a positive hit for most common tumors, have great value in 
tumor diagnosis, and consequently, in tumor prognosis . The skilled artisan would certainly know 
that such tumor markers are useful for better classification of tumors. Therefore, whether the 
PR01555 gene is amplified in 24 lung, colon, breast, kidney or testis tumors or in all lung, colon, 
breast, kidney or testis tumors is not relevant to its identification as a tumor marker, or its 
patentable utility. Rather, the fact that the amplification data for PR01555 is considered 
significant is what lends support to its usefulness as a tumor marker. If the goal is to diagnose 
lung, colon, breast, kidney or testis cancer, then contrary to the Examiner's assertion, a positive 
resuh does indicate the presence of cancer, while a negative result is not conclusive, and requires 
follow up testing. . 

The Examiner has asserted that "[o]ne cannot determine fi-om the data in the specification 
whether the observed 'amplification' of nucleic acid is due to increase in chromosomal copy 
number, or alternatively due to an increase in transcription rates." (Page 7 of the Office Action 
mailed December 30, 2004). Appellants note that the data in the specification relates to 
amplification of DNA . not mRNA, thus transcription rates would not affect this data. 
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The Examiner has further asserted that an increased number of chromosomes "is a very 
common characteristic of cancerous and non-cancerous epithelial cells" citing references by 
Hittelman and Crowell et al. in support. (Page 8 of the Office Action mailed March 18, 2004). 

Appellants respectfully submit that Hittelman and Crowell et al do not disclose DNA 
amplification in "normal" tissue or cells, but in tissue that has been damaged by carcinogenic 
agents and is closely associated with tumorous tissue . 

Appellants note that the title of the Hittelman paper is "Genetic Instabilities in Epithelial 
Tissues at Risk for Cancer." Hittelman studied lung tissue fi"om chronic smokers, which had 
been exposed for years to carcinogenic tobacco smoke. As Hittelman explains, "[tjumprs of the 
aerodigestive tract have been proposed to reflect a Tield cancerization' process whereby the 
whole tissue is exposed to carcinogenic insult (e.g., tobacco smoke) and is at increased risk for 
multistep tumor development (page 3). The detection of increases in chromosome number 
therefore identifies cells which have begun the first steps in this multistep progression to cancer. 
Even if these particular epithelial regions are not yet cancerous, their presence is strongly 
correlated with the development of cancer in the target tissue as a whole. Accordingly, 
Hittelman concludes that "the measurement of chromosome instability in the target tissue will be 
useful in assessing cancer risk as well as response to intervention" (page 10). 

The Crowell et al paper describes a similar study of lung tissue from smokers and former 
uranium miners. Crowell et al found that trisomy of chromosome 7 could be detected in 
nonmalignant bronchial epithelium from patients with lung cancer distant fi-om the site of the 
tumor and in individuals without tumors who are at high risk for lung cancer development (page 
632, col. 1) Crowell et al conclude fi-om these results that trisomy 7 "may be useful in early 
detection and intervention for lung carcinogenesis (page 636, col. 1). 

Accordingly, both Hittelman and Crowell et al show that an increase in chromosome 
number or gene amplification is associated not with normal tissues, but with cancerous, or pre- 
cancerous tissues, and therefore, an increase in chromosome number or gene amplification is a 
useful marker for a cancerous or pre-cancerous state. Detection of pre-cancerous cells or tissues 
is useful because, as explained by Hittelman, it allows for assessing cancer risk, as well as 
response to intervention. 
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Hence, Appellants respectfully submit that whether a pre-cancerous or tumor sample were 
analyzed, the showing of DNA amplification of the PR01755 gene would still be significant, 
since it would lead to the diagnosis of either a pre-cancerous state or a cancerous state, which is 
the utility asserted here. 

Accordingly, Appellants submit that based on the general knowledge in the art at the time 
the invention was made and the teachings in the specification, the specification provides clear 
guidance as to how to interpret and use the data relating to PR01555 nucleic acid expression and 
that the claimed nucleic acids which encode the PR01555 polypeptide have utility in the 
diagnosis of cancer. 

C. The Utility of the Claimed Nucleic Acids Does Not Depend Upon the 
Properties of the Encoded Polypeptide 

The Examiner has asserted that that the gene amplification data discussed above does not 
provide utility for the claimed PRO 1755 nucleic acids because "there is no evidence regarding 
whether or not PRO 1555 mRNA or protein levels are also increased in these cancers" that show 
amplification of the gene encoding PR01555. (Page 5 of the Office Action mailed December 30, 
2004). The Examiner has cited Pennica et al. in support of the assertion that "what is often seen 
is a lack of correlation between DNA ampUfication and increased peptide levels." (Page 4 of the 
Office Action mailed November 24, 2004). The Examiner also cited Haynes et al to the effect 
that "polypeptide levels cannot be accurately predicted firom mRNA levels." (Page 5 of the 
Office Action mailed December 30, 2004). 

Appellants respectfiiUy submit that the claims of the instant application are directed to 
nucleic acids, not polypeptides. Thus the question of whether gene amplification is associated 
with increased protein expression is irrelevant. The claimed nucleic acids have utility because 
they are amplified in limg and colon tumors, and thus may be used, for example, as diagnostic 
markers for lung or colon cancer. The expression level of the encoded polypeptide is irrelevant 
to this utility. 

^ The Examiner has fiarther asserted that "[f]urther research needs to be done to determine 
whether the small increase in PR01555 DNA supports a role for the peptide in the cancerous 
tissue." (Page 6 of the Office Action mailed December 30, 2004). 
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Appellants reiterate that that the claims are directed to nucleic acids , not polypeptides. 
Appellants* position regarding utility for these nucleic acids is based on the overwhehning 
evidence from gene (DNA) amplification data disclosed in the specification which clearly 
indicate that the gene encoding PR01555 is significantly amplified in certain lun g, colon, breast, 
kidney, and testis tumors . One of skill in the art would therefore understand how to use the 
claimed nucleic acids to detect ampHfication of the gene encoding PR01555 as a cancer 
diagnostic. 

The claimed nucleic acids can be used in cancer diagnosis without any knowledge 
regarding the function or cellular role of the encoded protein. Appellants submit that the law 
clearly states that "it is not a requirement of patentability that an inventor correctly set forth, or 
even know, how or why the invention works." Newman v. Quigg, 1 1 U.S.P.Q.2d 1340 (Fed. Cir. 
1989). Accordingly, the disclosure or identification of the mechanism by which PR01555 is 
associated with cancer is not required in order to establish the patentable utility of the claimed 
PR01555 nucleic acids. 

D. A prima facie case of lack of utility has not been established 

The Examiner has cited Hu et al, in support of the assertion that "the literature cautions 
researchers from drawing conclusions based on small changes in transcript levels." (Page 7 of 
the Office Action mailed December 30, 2004). 

Appellants submit that in order to overcome the presumption of truth that an assertion of 
utility by the applicant enjoys, the Examiner must establish that it is more likely than not that 
one of ordinary skill in the art would doubt the truth of the statement of utility. Accordingly, 
contrary to the Examiner's assertion. Appellants submit that Hu et al does not conclusively show 
that it is more likely than not that gene amplification is not correlated with the biological 
significance of cancer genes. First, the title of Hu et al is "Analysis of Genomic and Proteomic 
Data Using Advanced Literature Mining." As the title clearly suggests, the conclusion suggested 
by Hu et al is merely based on a statistical analysis of the information disclosed in the pubUshed 
literature. As Hu et al states, "We have utilized a computational approach to literature mining to 
produce a comprehensive set of gene-disease relationships." 
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In particular, Hu et al relied on the MedGene Database and the Medical Subject Heading 
(MeSH) files to analyze the gene-disease relationship. More specifically, Hu et al "compared 
the MedGene breast cancer gene list to a gene expression data set generated fi-om a micro-array 
analysis comparing breast cancer and normal breast tissue samples." (See page 408, right 
column). 

Therefore, Appellants first submit that the reference by Hu et al only studies the 
statistical analysis of micro-array data and not gene amplification data . Therefore, their findings 
would not be directly applicable to gene amplification data. In addition, Appellants respectfiiUy 
submit that the Hu et al reference does not show that a lack of correlation between microarray 
data and the biological significance of cancer genes is typical . 

According to Hu et al, ''different statistical methods" were applied to ''estimate the 
strength of gene-disease relationships and eyaluated the results:" (See page 406, left column, 
emphasis added). Using these different statistical methods, Hu et al "[a]ssessed the relatiye 
strengths of gene-disease relationships based on the fi-equency of both co-citation and single 
citation." (See page 41 1, left column). It is well known in the art that various statistical methods 
allow different variables to be manipulated to affect the outcome. For example, the authors 
admit, "Initial attempts to search the literature using" the Hst of geiies, gene names, gene 
symbols, and fi-equently used synonyms, generated by the authors "revealed several sources of 
false positives and false negatives." (See page 406, right column). The authors fiirther admit that 
the false positives caused by "duphcative and unrelated meanings for the term" were "difficult to 
manage." Therefore, in order to minimize such false positives, Hu et al disclose that these terms 
"had to be eliminated entirely, thereby reducing the false positive rate but unavoid ably under- 
representing some genes. " Id, Hence, Applicants respectfiiUy submit that in order to minimize 
the false positives and negatives in their analysis, Hu et al manipulated various aspects of the 
input data. 

Appellants fiirther submit that the statistical analysis by Hu et al is not a reliable standard 
because the firequency of citation reflects only the current research interest of a molecule rather 
than the true biological fimction of the molecule. Indeed, the authors acknowledge that 
"[r]elati6nship established by fi'equency of co-citation do not necessarily represent a true 
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biological link." (See page 411, right column). It often happens in scientific study that important 
molecules are overlooked by the scientific society for many years until the discovery of their true 
function. Therefore, Appellants submit that Hu et al drew their conclusions based on a very 
unreliable standard and that their research does not provide any meaningful information 
regarding the correlation between microarray data and the biological significance of a molecule. 

Even assuming that Hu et al provide evidence to support a true relationship, the 
conclusion in Hu et al only applies to a specific type of breast tumor (estrogen receptor (ER)- 
positive breast tumor) and can not be generalized as a principle governing microarray study of 
breast cancer in general, let alone the various other types of cancer genes in general . In fact, even 
Hu et al admit that, "[i]t is likely that this threshold will change depending on the disease as well 
as the experiment. Interestingly, the observed correlation was only found among ER-positive 
(breast) tumors not ER-negative tumors." (See page 412, left column). Therefore, based on 
these findings, the authors add, "This may reflect a bias in the literature to study the more 
prevalent type of tumor in the population. Furthermore, this emphasizes that caution must be 
taken when interpreting experiments that may contain subpopulations that behave very 
differently." Id. (Emphasis added). 

In summary. Appellants respectfully submit that the Examiner has not shown that a lack 
of correlation between microarray data and the biological significance of cancer genes, as 
observed for ER-positive breast tumor, is typical . Since the standard is not absolute certainty, a 
prima facie showing of lack of utility has not been made in this instance. The Patent Office has 
failed to meet its initial burden of proof that Appellants' claims of utility are not substantial or 
credible. The arguments presented by the Examiner in combination with the Hu et al article do 
not provide sufficient reasons to doubt the statements by Appellants that PR01555 has utility. 
Therefore, Appellants submit that the Examiner's reasoning is based on a misrepresentation of 
the scientific data presented in the above cited reference and application of an improper, 
heightened legal standard. 

For the reasons given above. Appellants respectfully submit that the present specification 
clearly describes, details and provides a patentable utiUty for the claimed invention. 
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Accordingly, Appellants respectfully request reconsideration and reversal of the rejections of 
Claims 33, 38-40 and 44-54 under 35 U.S.C. §101. 



ISSUE II: Claims 33, 38-40 and 44-54 satisfy the enablement requirement of 35 U SC S112, 
first paragraph. 

Claims 33, 38-40 and 44-54 stand rejected under 35 U.S.C. §112, first paragraph, allegedly 
"since the claimed invention is not supported by either a specific and substantial asserted utility or a 
well established utility ... one skilled in the art clearly would not know how to use the claimed 
invention." (Page 4 of the Office Action mailed December 30, 2004). 

In this regard. Appellants refer to the arguments and information presented above in 
response to the outstanding rejection under 35 U.S.C. §101, wherein those arguments are 
incorporated by reference herein. Appellants respectfiiUy submit that as described above, the 
nucleic acids encoding the PRO 155 5 polypeptide have utility in the diagnosis of cancer and 
based on such a utility, one of skill in the art would know exactly how to use the claimed nucleic 
acids for diagnosis of cancer, without undue experimentation. One of skill in the art would also 
know exactly how to use the claimed fragments of SEQ ID NO:337, for example as probes to detect 
gene amplification of SEQ ID NO:337 (encoding PR01555) wherein such gene ampUfication can 
be used as a diagnostic marker for lung and colon tumors, 

Accordingly, Appellants respectfully request reconsideration and reversal of the 
enablement rejection of Claims 33, 38-40 and 44-54 under 35 U.S.C. §112, first paragraph. 

CONCLUSION 

For the reasons given above. Appellants submit that the specification discloses at least 
one patentable utility for the PR01555 nucleic acids of Claims 33, 38-40 and 44-54, and that one 
of ordinary skill in the art would understand how to used the claimed nucleic acids, for example 
in the diagnosis of lung and colon tumors. Therefore, claims 33, 38-40 and 44-54 meet the 
requirements of 35 USC §101 and 35 USC §1 12, first paragraph. 

Accordingly, reversal of all the rejections of claims 33, 38-40 and 44-54 is respectfiiUy 
requested. 
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Please charge any additional fees, including fees for additional extension of time, or 
credit overpayment to Deposit Account No. 08-1641 (referencing Attorney's Docket 
No. 39780-2830 P1C63) . 

Respectfully submitted, 



Date: August 25, 2005 By: /7»n^ 

Barrie D. Greene (Reg. No. 46,740) 

HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650)324-7000 
Facsimile: (650)324-0638 
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8. CLAIMS APPENDIX 

Claims on Appeal 

33. An isolated nucleic acid comprising: 

(a) the nucleic acid sequence of SEQ ID NO:337; 

(b) the full-length coding sequence of the nucleic acid sequence of SEQ ED NO:337; 

or 

(c) the full-length coding sequence of the cDNA deposited under ATCC accession 
number 203322. 

38. The isolated nucleic acid of Claim 33 comprising the nucleic acid sequence of 
SEQIDNO:337. 

39. The isolated nucleic acid of Claim 33 comprising the full-length coding sequence 
of the nucleic acid sequence of SEQ ID NO:337. 

• 40. The isolated nucleic acid of Claim 33 comprising the full-length coding sequence 
of the cDNA deposited under ATCC accession number 203322. 

44. A vector comprising the nucleic acid of Claim 33. 

45 . The vector of Claim 44, wherein said nucleic acid is operably linked to control 
sequences recognized by a host cell transformed with the vector. 

46. An isolated host cell comprising the vector of Claim 44. 

47. The host cell of Claim 46, wherein said cell is a CHO cell, an E. coli or a yeast 

cell. 

48. An isolated nucleic acid molecule consisting of an at least 20 nucleotides 
fragment of the nucleic acid sequence of SEQ ID NO:337, or a complement thereof, that 
specifically hybridizes under stringent conditions to: 
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(a) the nucleic acid sequence of SEQ ID NO:337 or a complement thereof; 

(b) the full-length coding sequence of the cDNA deposited under ATCC accession 
number 203322 or a complement thereof; 

wherein, said stringent conditions use 50% formamide, 5 x SSC, 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5x Denhardt's solution, sonicated sahnon 
sperm DNA (50 /ig/ml), 0.1% SDS, and 10% dextran sulfate at 42 ^C, with washes at 42 °C in 
0.2 X SSC and 50% formamide at 55 ^C, followed by a wash comprising of 0.1 x SSC containing 
EDTA at 55 "C, wherein said isolated nucleic acid molecule is suitable for use as a PGR primer 
or probe. 

49. The isolated nucleic acid molecule of Claim 48 that is at least 50 nucleotides. 

50. The isolated nucleic acid molecule of Claim 48 that is at least 60 nucleotides. 

5 1 . The isolated nucleic acid molecule of Claim 48 that is at least 70 nucleotides. 

52. The isolated nucleic acid molecule of Claim 48 that is at least 80 nucleotides. 

53. The isolated nucleic acid molecule of Claim 48 that is at least 90 nucleotides. 

54. The isolated nucleic acid molecule of Claim 48 that is at least 100 nucleotides. 
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9. EVIDENCE APPENDIX 

1. Declaration of Audrey D. Goddard, Ph.D. under 37 C.F.R. §1.132, with attached Exhibits 
A-G: 

A. Curriculum Vitae of Audrey D. Goddard, Ph.D. 

B. Higuchi, R. et al., "Simultaneous amplification and detection of specific DNA 
sequences," Biotechnology 10:413-417 (1992). 

C. Livak, K.J., et al., "Oligonucleotides with fluorescent dyes at opposite ends provide a 
quenched probe system useful for detecting PGR product and nucleic acid hybridization," PCR 
Methods Appl. 4:357-362 (1995). 

D. Heid, C.A. et al., "Real time quantitative PCR," Genome Res. 6:986-994 (1996). 

E. Pennica, D. et al, "WISP genes are members of the connective tissue growth factor 
family that are up-regulated in Wnt-1 -transformed cells and aberrantly expressed in human colon 
tumors," Proc. Natl. Acad. Sci. USA 95:14717-14722 (1998). 

F. Pitti, R.M. et al., "Genomic amplification of a decoy receptor for Fas ligand in lung and 
colon cancer," Nature 396:699-703 (1998). 

G. Bieche, I. et al., "Novel approach to quantitative polymerase chain reaction using real- 
time detection: Application to the detection of gene ampUfication in breast cancer," Int. J. Cancer 
78:661-666 (1998). 

2 . Hittelman, W., "Genetic instabihty in epithelial tissues at risk for cancer," Ann. NY Acad 
5ci. 952:1-12 (2001). 

3. Crowell, et al., "Detection of trisomy 7 in nonmahgnant bronchial epithelium fi-om lirng 
cancer patients and individuals at risk for limg cancer," Cancer Epidemiol. 5:631-637 (1996). 

4. Pennica, D. et al., "WISP genes are members of the connective tissue growth factor family 
that are up-regulated in Wnt-1 -transformed cells and aberrantly expressed in human colon tumors," 
Proc. Natl. Acad Sci. USA 95:14717-14722 (1998). 

5. Haynes, P.A., et al., "Proteome analysis: Biological assay or data archive?" Electrophoresis 
19:1862-1871 (1996). 

-23- . 

Appeal Brief 
Application Serial No. 10/017,527 
Attorney's Docket No. 39780-2830 P1C63 



6. Hu, Y. et al., "Analysis of genomic and proteomic data using advanced literature mining," 
Journal ofProteome Research 2:405-412 (2003). 

Item 1 was submitted with Appellants' Response filed September 16, 2004, and made of record by 
the Examiner in the Office Action mailed December 30, 2004. 

Items 2-3 were made of record by the Examiner in the Office Action mailed March 18, 2004. 
Items 4-6 were made of record by the Examiner in the Office Action mailed December 30, 2004. 
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PATENT 



m THE UMTED STATES PATENT AND TRADEMARK OFFICE 



In re Application of: Ashkenazi et aL 

Serial No.: 09/903,925 

Filed: July 11, 2001 

For SECRETED AND 

TRANSMEMBRANE 
POLYPEPTIDES AND NUCLEIC 
ACIDS 



Group Art Unit: 1647 



Examiner: Fozia Hamid 



DECLARATION OF AUDREY J). GODD Am Ph,D UNDER 37 CRIL S 1^132 

Assistant Gommissioner of Patents 
Washington, D.C. 20231 



Sin ; ■■■''."*■. 

I, Audrey D. Goddard, PLD. do hereby declare and say as follows: 

1 , I am a Senior Qinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2, Between 1993 and 200 1 , 1 headed the DNA Sequencing Laboratory at flie Molecular 
Biology Department of Genentech, Inc, During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3, My scientific CumculimiVitae, including my Ust of publicatioiis, is 
fonns part of this Declaration (Exhibit A). 



Serial No.: * 
Filed;* 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PGR (ie,, 
**gene amplification") assay described in the above captioned patent application. 

5/ The TaqMan PGR assay is described, for example, in the following scientific 
publications: Higuchi et al.. Biotechnology 10:413-417 (1992) (Exhibit B); Livak ei al., P^ 
Methods AppL. 4:357-362 (1995) (Exhibit C> and Heid et ai. Genome Res. 6:986-994 (1996) • : 
(Exhibit D). Briefly, the assay is based on the principle that successfiil PGR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonucleiise digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PGR primers. The extent of 
digestion depends directiy on the amount of PGR, and can be quantified accun^ 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PGR reaction and, as a result, 
leads to accurate detennination of gene copy number. 

6. The quantitative fluorescent TaqMan PGR assay has been extensively and • 
successfiilly used to characterize genes involved in cancer developmerit and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PGR assay is exemplified by the following scientific publications: Pennica et al., Proc. 
Natl. Acad. Sci. USA 95(25): 147 17-14722 (1998) (Exhibit E); Pitti et aL, Nature 
^Q6(671 7):699^7Q3 (1998;> fExhibit and Biecfae et aL Int. J. Gancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, iPennica et al have used the quantitative 
TaqMan PGR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normsil mucosa. Pitti et cd: studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative Taqjvlan PGR assay. Bieche 
et ai used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PGR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy: 
number in a tumor tissue sample relative to a noraial (i:e,, non-tumor) sample is significant and 
usefiil in that the detected increase in gene copy number in the tumor sample relative to the normal 
isaraple serves as a basis for using relativie gene copy number as quantitated by the TaqMan PGR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown * 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PGR assay in a tumor sample relative to a normal sample is usefiil as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the eflScacy of cancer 
thersqjy. 

8. r declare further that all statements naade herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Gode, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereoa 

I^ate. Audrey D.Goddard,Ph.D. 
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AUDREY D. GODDARD, Ph.D. 



Genentech. Inc. 
1 DNAWay 

South San Francisco, OA, 94080 

650,225,6429 

goddarda@gene.com 



110 Congo St. 

San Francisco. CA. 94131 

415,841.9154 

415.819,2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 



1993-present 



Genentech, Inc. 
South San Francisco, CA 

2001 -present Senior Clinical Scientist 
Experimental Medicine / BidOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational researcti using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 11 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 
- • Application of phartnacogenomics in clinical trials 



1998-2001 Senior Scientist 

Head of the DNA Sequencing Laboratory. Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DAM sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

9 Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993 - 1998 Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person researcf) facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identificatio 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89 -12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelbcytic leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

MclWaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



1989 



Honours B.Sc 

•The in vitro metabolism of the cytochrome P-448 
inducier p-naphttioflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 

Imperial Cancer Research Fund Postdoctoral Fellowship 
Medical Research Council Studentship 
NSERC Undergraduate Summer Research Award 
Society of Chemical Industry Merit Award (Hons. Biochem,) 
Dr. Harry Lyman Hooker Scholarship 
J.LW, Gill Scholarship 

Business and Professional Women's Club Scholarship 
Wyeriiauser Foundation Scholarship 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel genes. Functional Genomics: From Genome to Function, Litchfield 
Parte, AZ. USA, October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science. Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 
2000 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Beri<eley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech. perspective. Bay Area Sequencing Users 
Meeting. Beri<eley, OA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting. San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76*^ Annual Meeting of The 
Endocrine Society. Anaheim. CA. USA. June 1 994 , 

A previously uncharacterized gene, myl. is fused to the retinoic acid receptor alpha gene in 
acute promyelocytic leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 



1989-1992 
1983-1988 
1983 
1983 

1981-1983 
1981-1982 
1980-1981 
1979-1980 
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PATENTS 

Goddard A, Godowski PJ. Gurney AL. NL2 Tie ligand homologue polypeptide. Patent 
Number: 6.455.496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homologue nucieic acids. Patent 
Number: 6.426,218. Date of Patent: July 30, 2002. 

Godowski P. Gumey A, Hillan KJ, Botstein D, Goddard A. Roy M, Fen-ara N. Tumas D. 
Schwall R: NL4 Tie ligand homologue nucleic add. Patent Number: . 6.4 137,770. Date of 
Patent: July 2. 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL. Napier MA. Tumas D. Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number 6,410,708. Date of Patent: 
Jun. 25, 2002. , 

Botstein DA. Cohen RL. Goddard AD, Gumey AL, Hillan KJ. Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic jadds encoding same: 
Patent Number 6,387,657, Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gumey AL Tie ligands. Patent Number 6,372.491. Date of 
Patent April 16. 2002. 

Godowski PJ. Gurney AL. Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number 6.350,450. Date of Patent Feb. 26. 2002. 

Fong S. Ferrara N. Goddard A. Godowski PJ. Gumey AL. Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number 6.348.351 . Datie of Patent 
Feb. 19,2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number 6.348,350. 

Date of Patent Feb. 19. 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number 6,207,640. Date of Patent March 27, 
2001. . 

Fong S. Ferrara N. Goddard A, Godowski PJ, Guraey AL. Hillan K and Williarhs PM. Nudeic 
adds encoding NL-3. Patent Number 6.074,873. Date of Patent June 13. 2000 

Attie K. Carlsson LMS. Gesunheit N and Goddard A Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5.824.642; Date of Patent October 20. 1998 

Attie K, Carlsson LMS. Gesunheit N and Goddard A. Treatment of partial growth honnone 
insensitivity syndrome. Patent Number 5,646.1 13, Date of Patent July 8. 1997 

Multiple additional provisional applications filed 




Audrey D. Goddard, Ph.D. . . . page 5 of 9 

PUBLICATIONS 

Seshasayee D. Dowd P. Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C. Smith RJ and Chemausek SD. Human 
IGF-1 receptor mutation^ resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S. Xie, M-H. Foster J. Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG. Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S.. Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thiermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. S/ocfte/n/ca/ Jouma/ 360: 135-142: 

Lee J. Ho WH, Maruoka M. Cprpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gumey AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
1 7 receptor homolog IL-1 7Rh1 . Journal of Biological Chemistry 276(2): 1 660-1 664. 

Xie M-H, AgganA/al S, Ho W-H, Foster J. Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gumey AL. (2000) Interieukin (IL)-22. a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 215': 
31335-31339: 

Weiss GA. Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Nati Acad, ScL USA 97: 
8950-8954. 

Guo S, Yamaguchi Y. Schilbach S, Wada t.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal . 
development. A/afure 408: 366-369, 

Yan M, Wang L-C, Hymowitz SG, Schilbach S. Lee J, Goddard A, de Vos AM. Gao WQ. Dixit 
VM. (2000) Two-arhino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Sc/ence 290: 523-527: 

Sehl PD, Tai JTN, Hillan KJ. Brown LA. Goddard A, Yang R. Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth,, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A. Wilson SW. Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. A/euron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W. Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A. Foster J, Liang J, Bmsh J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique. specificity for FGFR4. Cyto/c/ne 11: 729-735. . 
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Yan M. Lee J, Schilbach S, Goddard A and Dixit V, (1999) mE10. a novel caspase 
recruitment domain-containing proapoptotic molecule, J. Biol. Chem. 274(15): 10287-10292. 

Gurney AU, Marsters SA, Huang RM, Pitti RM, Mark DT. Baldwin DT. Gray AM, Dowd P. 
Brush J, Heldens S, Schow P, Goddard AD, Wood Wl, Baker KP, Godowski PJ and 
Ashkenazi A, (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR, Current Biology 9(4): 215-218. 

Ridgway JBBi, Ng E. Kern JA .Lee J. Bmsh J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumdr cell Knes. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC. Dowd P, Huang A, Donahue CJ, 
Sherwood SW. Baldwin DT, Godowski P J, Wood Wl, Gumey AL, Hillan KJ. Cohen RL, 
Goddard AD, Botsteip D and Ashkenazi A, (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D. Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA. 
Deuel B, Lew M. Watanabe C. Cohen RL. Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gumey AL. Botstein D and Levine AJ, (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors, Proc, NatL Acad. Sci. USA. 95(25): 14717- 
14722. 

Yang RB. Mark MR, Gray A, Huang A, Xie MH, Zhang M. Goddard A, Wood Wl, Gumey AL 
and Godowski PJ. (1998) Toll-like receptor'-2 mediates lipopolysaccharide-induced cellular 
signalling. A/afare 395(6699): 284-288. 

Merchant AM, Zhu Z. Yuan JQ. Goddard A, Adams CW. Presta LG and Carter P. (1998) An 
effident route to human bispedfic IgG. A/afure 8/ofec/?no/pgy 16(7)^ 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1 998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J. Murone M. Luoh SM. Ryan A, Gu Q, Zhang C. Bonifas JM, Lam CW. Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell cardnoma. A/afure. 391(6662): 90-92, 

Marsters SA, Sheridan JP. Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J. Gumey A, 
Goddard AD. Godowski P and Ashkenazi A, (1 997) A novel receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biology. 7(1 2): 1 003-1 006. 

Hynes M. Stone DM, Dowd M. Pitts-Meek S, Goddard A. Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor G//-1 Neuron 
19:15-26, 

Sheridan JP, Marsters SA, Pitti RM, Gumey A., Skubatch M. Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl. Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD. Dowd P. Chernausek S. Geffner M, Gertner J, Hintz R. Hopwood N, Kaplan S, 
Plotnick L. Rogol A, Rosenfield R, Saenger P. Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth homione Insensitivity: The role of growth hormone receptor mutations In 
idiopathic short stature. J. Pecf/a<r. 131: S51 -55. 

Klein RD. Sherman D, Ho WH, Stone D, Bennett GL. Moffat B, Vandlen R, Simmons L. Gu Q, 
Hongo JA. Devaux B, Poulsen K, Armanini M, Nozaki C. Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to fomi a candidate neurturin receptor. Nature. 387(6634): 717-21 . 

Stone DM, Hynes M. Armanini M. Swanson TA. Gu Q, Johnson RL, Scott MP, Pennica D, 
Goddard A. Phillips H, Noll M, Hooper JE. de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Mareters SA, Sheridan JP, Donahue CJ, Pitti RM. Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and aotiyates apoptosis and NF-kappa p. Current Biology 6(12): 
1669-76. 

Rothe M. Xiong J. Shu HB. Williamson K, Goddard A and Goeddel DV. (1 996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-medlated signal transduction. Proc. Natl. 
Acad Sc/. (ySA 93: 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D. Henzel W and Bass S. (1 996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes, a periplasmic beta- 
glucosidase. Mcrot/o/ogy 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinoloov of Cancer. 
Waxman, J. Ed. Cambridge University Press. Cambridge UK, pp.1 87-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DU, Poulson KT, Beck CD, Gray C, Armanini 
MP. Pollocks RA. Hefti F, Phillips HS. Goddard A, Moore MW. Buj-Bello A, Davis AM. Asai N. 
Takahashi M. Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. A/afure 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteiriis and receptors. Proc. Natl. Acad. Sci. USA 93: 71.08-71 ^Z. 

Winslow JW. Moran P, Valverde J. Shih A. Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ. Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kiriase 
receptor irivolved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
A/af/. Acad Sc/. L/SA 92: 1866-1870. 

Huang X, Yuang J, Goddard A. Foulis A, James RF. Lemmarit A, Pujol-Bon-ell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type i diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, FairiDaim L, Dexter M. Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolpg of the leukemia-associated PML gene. Mammalian Genome 
6:732-737. 
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Goddard AD. Covello R, Luoh SM. Clackson T, Attie KM, Gesundheit N. Rundle AC, Wells 
JA. Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS. Moran P. Gripp J. Armanini M, Phillips HS, Goddard A arid Caras IW. (1994) 
Identification and characterization of Batk. a predominantly brain-spedfic non-receptor protein 
tyrosine kinase related to Csk. J. Net/rose/. Res. 38: 705-715. 

Mark MR, Scadden DT. Wang Z. Gu Q, Goddard A and Godowski PJ. (1994) Rse. a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Jouma/ of e/o/og/ca/CAiem/siiy 269: 10720-1 G728. 

Borrow J, Shipley J, Howe K. Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC. 
Fioretos T. Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute prpmyelocytic leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet. 21: 321-376. 

Borrow J. Goddard AD, Gibbons B. Katz F, Swirsky D. Ftoretos T. Dube I, Winfield DA. 
Kingston J. Hagemeijer A. Rees JKH. Lister AT and Solomon E. (1992) Diagnosis of acute 
promyeloc^tic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haemato/. 82: 529-540. 

Goddard AD, Bon-ow J and Solomon E. (1992) A previously uncharacterized gene. PML. is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA. Becker A, Phillips RA and GaHie BL. (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Ceil. Genet. 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1 991 ) Retinoblastoma linked with Seascale [letter]. 
Bniffs/) Med J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations In cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D. Solomon E, Pihiajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cyfogenef. Ce//. Genet 56: 165-168. 

Bon-ow J. Black DM. Goddard AD. Yagle MK. Frischauf A.-M and Solomon E. (1 991 ) 
Construction and regional localization of a A/ofl linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Bon-pw J, Goddard AD, Sheer D and Solomon E. (1 990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1580. 
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Myers JC. Jones TA, Pohjolainen E-R. Kadri AS. Goddard AD, Sheer D, Solomon E and 
Pihlajaniemi T. (1990) Molecular cloning Of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
<3enef. 46: 1024-1033. 

Gallie BL, Squire JA, Goddard A, Dunn JM, Canton M, Hinton D. Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest. €2: 394-408. 

Goddard AD, Phillips RA, Greger V, Passarge E, Hopping W. Gallie BL and Horsthemke B. 
(1990) Use of the RBI cDNA as a diagnostic probe in retinoblastoma families. Clinical 
GeneticsZJ: 117-126. 

Zhu XP. Dunn JM, Phillips RA, Goddard AD, Paton KE. Becker A and Gallie BL (1989) 
G^mlline, but not somjatic. mutations of the RBI gene prefereritially involve the paternal 
allele. A/afuAB 340: .312-314. 

Gallie BL, Dunn JM, Goddard A. Becker A and Phillips RA. (1988) Identification of mutations 
in the putative retinoblastoma gene. In Molecular Bioloov of The Eve: Genes. Vision and 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 88. 
J. Piatigbrsky. T. Shinohara and P.S. Zelenka, Eds. Alan R. Liss, Inc., New York. 1988, pp. 
427-436. 

Goddard AD, Balakier H, Canton M. Dunn J, Squire J. Reyes E, Becker A, Phillips RA and 
Gallie BL. (1988) Infrequent genomic rean-angement and normal expression of the putative 
RBI gene in retinoblastoma tumors. A//0/. Ce//. 8/o/. 8: 2082-2088. 

Squire J, Dunn J, Goddard A. Hoffman T. Musarella M, Willard HF, Becker AJ. Gallie BL and 
Phillips RA. (1986) Cloning of the esterase D gene: A polymorphic gene probe closely linked 
to the retinoblastoma locus on chromosome 13. Proc. Natl. Acad. Sci. USA 83: 6573-6577. 

Squire J, Goddard AD, Canton M, Becker A, Phillips RA and Gallie BL (1986) Tumour 
induction by the retinoblastoma mutation is independent of U-myc expression. Nature 232: 
555-557. 

Goddard AD, Heddle JA, Gallie BL and Phillips RA. (1985) Radiation sensitivity of fibroblasts 
of bilateral retinoblastoma patients as determined by rriicronucleus induction /n vflra. Mutation 
Researc/Jl52:31r38. 
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We have enhanced the polymerase chain 
f eviction (PCSR) such that specific DNA 
sequences can be detet^ wifhtaiit open- 
ii^ the reaction tnbe. This enbancemeni 
feqnires die addition of edu^nm bjrOnxide 
(EtBr) to a PCR« Since fluorescence of 
EtBr increases in the presence of double* 
stranded (dif) DNA an joicrease in finoi^- 
cence in sudx a PGR indiicates a positive 
amplification, which can be easily mioni* 
tored externally. In fact, ainplificatiion can 
be continuously nx^nitored In order to 
follow its progress, tile ability to simulta- 
fieously amplifj^ specific DNA sequences 
and detect the product of the amiOificalion 
both simplifies and improves PCR and 
ntay fecilitaie its autonoation and more 
widespread use in the clinic or in other 
situatMUS requiriilg hi|^ sample i^bkrough- 
put 

Although the potential Ij!Cik:6.u; of PCR* to cUu- 
tcai dsagnoitKs arc wcsll kiiowii*'*, k h still not 
widely used iiit this setting, even though U is 
fottr yearft eiuoo thcrroomW^ DNA p^jrfni^r* 
SOUS** mfldc PCR pxacticsd. Some of the r^bns for itfi $low. 
sQce^MAtiec are high 0Q9t, tadc of automation of pre-r and 
posC-PCR procc39ing steps, and false positive reswts. from 
carryover-contamination. The fim two p(»nt» arc rdated 
in th^kt Ubot is the largest cotttributor to cost ait the present 
stage of PCR dcvelopmeat. Most tturtciit assays rcquittt 
ooinc form of **downstre»ui" pttjcessing once tbennocy- 
ding ifi done in order determine whether the target 
DNA sequence was present and ha$ amplfficd, Thcie 
include DNA h^ridization**, gel €le<to)pho»<*»* 
without use of TOtrktiort digeadon^**, HrfXr, or caj^iary 
ctectTophorcsU*^* These methods ate labor-intense, have, 
low ihroa|^put« and arc difficult to aotomatc. The third 
p<^ is abo closdy rdated to downstream processing. 
The haudl'mg of the PCR prodvct in these downstream 
processes increases the chances that kmpiyied DNA will 
spxead through the t^TP'^g' hh» fesuluxig in • a .risk of 



'•carryover" £s^be positives in subsequent testing". 

These dowixstrcam processing steps would be elimi- 
nated if spedficanipljfication and detection of amph&cd 
ONA took i^ace simuftancon^ly within an unopened re- 
action vessel Assays m wbkh such dilFerent procc$se$ take 
place without. the need <o separate reaction components 
have been temed •^homogeneous". No truly hbmogc-. 
t^tts assay has been dcinonstratiMl to dac^^ 
progress towards this end has been reponcd. C3i^]cabr et 
al'^ developed a PC«l |H<H3iia detection schensc using 
fluorescent pnmers that requited in a fluorescent FCR 
product Alldc-fipedfic primers, cwh with different Bikh 
tescsent tags, were nsed to iodiGaOc the gCfiotype of the 
DNA. However^ the iminnnnporaced primers must still be 
s%moved in a downstream praocss in order to vfeuaJizc the 
result Recently, HoBaiid, et al,*', developed an asw^ 
wiildi the endojgenous 5' exdoudease assay erf r<flf DNA 
potymeiafie was exploited to deave a labeled ^jBgonudeo- 
tide probe. The probe would only deave if PCR aro|*n- 
cation had pwhiced its comj^mentary scqucndt. In 
wler to detect the deavage pwiducts, however, a sub^e- 
qnent process is again needed. 

We have developed a tttily homogeneous assay for PCR 
and PGR produa detecdbn based upon tbc gready ut- 
creased fluorescence that ethidium btoinSde and other 
DNA binding dyes exhibit whcaj they are bound to. ds- 
DNA^^*'. As oudino! In Figure 1, a proiotypic PCR 
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PC^prcKiucC TEtccOiSmOocnt^ ofa PGR coDUrtnh^EvBr thai ans 
OuoresOTf areRswd— £tBr itseiC Et»r bound tocadicr ssDNAot 
daDN A. Tbere Is « Ui:^ Quorcscencc cnbanomicxit whcD EtBr |s 
bound to t>NA and Ksndm> is greatly eahanccd when DNA js 
doufalc-strzndcd, Ato sq$dait <njf..cvcics of PCR* the .net 
ttiarca»e in d.'ipNA resuks in addidonal £tPr buMfing, and » net 
iocrcase in total fluorcsoenat: 
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tWWdi It Gd dccm>phof«^ cyf PCtl atn ptiScadon prodMCts cf chc 
humaor tiudear gene, HLA DQtt, made m the pmcncc of 
incrcssidg amounts of EtBr (up to 6 fL^toQ. T|ic presence of 
&:tBr b» 110 oMom cffea <m ckc yldM or spcd^^ 
cackm. 





R0OIC > (A) nooiesGcnc^ mcasurcnienu 6t>oi PCRs that coutain 
OS p^od ElBr ackd 'th^ 9Xt specific fen- V-<])^tno$oaK repeat 
5«<w«tiix:*, Five nn^aite PCRi were begun ooptsliiinff each oi tbc 
DNA« spedfied. A« racb mdicma cyde, one cf the five repGcatc 
l^CRs for eodi DNA uras fcjiiov^ from thcrmocTding and Hs 
fluoreaGencc measured. Units of fluo^vAcciKe »re aibitrair. (B) 
Uy pbeii^gTaphy of PCRtubes (03 ml Eppcodorf ^tyle, poi)^pit>- 
pjdene micro-cemri^iro tiAes) cont»mrag reacttovis, those scat*, 
tng £rom 2 ngmaleDNA and contn)! rcamm wkhottt any DHA^ 
from (A). 
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bcgiiu ^dk prixaers that are single-stranded DNA (ss' 
DNA), dNTFs, and DNA polymerase^ An amount of 
dsDNA containing the target sequence (target DN^A) h 
also typically present. Thh amouat can vary, depending 
on the applicatioa, from sii^le<!elJ amount) of DN A*^ Co 
micro^;raoi8 pa* FCR^^ If EtBr b present^ the reagenu 
that wiU fluoresce, in order of increasing fluorescence* are 
EtBr Hsdfi and EtBr bound to the singk-fitraxidcd 
DNA ptimen ^nd ta the doiiblo^txriiided target DNA (bf 
its mtercahdon between the stacked 1»$e9 of the DNA 
doublc<hcfi;K)< After the first denatutaticm cyde, tar^t 
DNA wiU be large:ly singlc-Wanded, After a PGR 13 
oompleted* the most stgnihcant change is the increase in 
the amount of dsDNA (the PGR produa itself) of up to 
several mierogr^uns. Fonneriy free EtBr <s bound to the 
additional dsDNA« resulting in an increase in fluorps- 
ccnce* There is also some decrease in the amount of 
ssDNA primer^ but becau^ tbc landing of EtBr to S6DN A 
is much less than to dsDNA, the eflect of tim change on 
the total Ruorc^cciicc oi the samite is &ma]L The flootx^- 
cence increase can be measured by directing cxcEtatK»i 
iltumiftad^m thnpugh the walls of the amplificadm vessel 
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before ^nd after* or even a)niji]i«ously during, thermocy. 
ding. ^ 



RESULTS 

PGR in die presence of EtBr. lo order to assei»s the 
affect of EtBr Iq FOR, ampHficadons of the fauirian Hl^ 
DQa giHw*' vrere performed with the dye preseiu ac 
concentration$ from 0.06 to 8.0 p-g^ml (a typical concen- 
tration of ErBr u.<^ed tn staining of nucleic aods ibllowiog 
gel electrophoresis is 0.5 p^wf). As shown in Rgure 2/g€i 
electrophoresis ix^veated little or no diiBEcrencc in the yield 
or qualiiy of tbc ampiiCication product i4>e(b^r EtBr was 
absent or present at any of these oonoentrfliion$, indicat- 
ing diat EtBr does not inhibit fCSL 

Detecficm of faqmsia Y-dtaxmMOmon spedlic 
^DGoces- Sequence^pecilK, fluorescence eohanoement of 
EtBr as a result of PGR Yias demonstrti^Utd in a scries of 
amplificadons contajmug 0,5 lig/ml EtBr and pHmer$ 
fipedfic to repeat DNA sequcticcs (bnod on the human 
Y-chromo^omc**. These fCRs initial^ cxnxtamed either 
€0 og male, 60 femaic. 2 ng male Ittiraan or no DNA. 
Five replicate PCRs were begun for 6acb DNA« After 9» 
17, S;] , 24 and 29 cydes of thenuocyding, a FOR cadi 
DNA was removed from the thermocydier* anid its fluo- 
resoenoe measured in a spectrofiuoromeler and plotted 
vs. ampiificadon cyde number (Fig. 3 A). The shape of tUs 
curve rcSccts the &ct that by me time an increase in 
fluorescence can ha dct^mat^^ tlic izicreasc in DNA i$ 
beconung linear and not exponential wich cyd^ number; 
As shown, the Auorrsccncc i<K:rca$cd about three-fold 
over the iMiCkgroand ftuotescfekifce for die PCRs Contain- 
log htmian male DNA, but did not significantly increase 
Ujv negative control FCE^ which cont^ed either no 
DNA or human femakr DNA. The more male DNA 
present to begin with— 60 n| versus ? ng-nive fewer 
cydes were needed tto give a. dctectaWe increase in fluo- 
rescence. Od electropnoreatis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pe<^ skc wccc made in the mak DNA containing 
reactions and that Uuh DNA symbe«a$ lo^ piaoe in the 
control sam^es. 

In addition^ the iacrcw in ftuomccncc visualized 
by simply faying the compfeted, unopened PCRs QD a UV 
tramilhiminator and photogmpbing them through a red' 
filter. This k shown In figure 3B ibr the reiatcdons thai 
began with 2 ng male DNA and those with no DNA* 

De(«etioit oT ^>ecs6c allck^ of ttyc htiman 0-^obiD 
gene. In order to demonstrate that this a^prdadi has 
adequate spedfidty to allow genetic screexung* a d^ccdoq 
of the ^tckfc-cdl fmicmia mulaitMa was peifbrmcd F^^ure 
4 shov^ the (H)Ore$oenGc from csomplctcd amp^catio^ 

oantaiioiag EtBr (0.& y^Qh»id) At Ae!ttexi6d by photogrsiphy 

of the reaction tubes on a UV transiliaminator. These 
reactions were performed Hang prixnc;ni spedftc for ci* 
thcr the wild*irae or sickle-oell mutattoa of the human 
p-^obin gene*^ The spocilRdty for each aUdc is unpartcd 
by pladng the sickie-muiation site at the terminal 3' 
nuc^Mrddc of one primer. By using an appropriate primer 
annealing temperature^ ptirn^ octeaslo^t^-'and thus aixb- 
pti5c9tion--can take place only if the 5' nucleotide of t^ 
primer is coi^plemcntaTy to the p^JkMd aBdc prc^cnt^*^. 

£adi fiair Of ^mpBfications shown in Figure 4 consists of 
a reacttoh wiOi either tbc wiMnypc allcte $pcdfic Gcft 
tiibc) or sickk>aUefe specific (rigbt tube) pnmers. Three 
di£fepent DNA^ were typed: DNA fnotn a homozygous. 
wtW-typc 0-^obin individual (AA); from a heterozygous 
sickle pMgipbin individu^ (AS): and from a homozygous 
sickle p-glc^o ixidi^ual (SS). Eadj Dt^A (50 ng gcn^omic 
DNA to Stan e^ PGR) was qfialyzcd in triplicate (3 pain 
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reactions each). The DNA .type vas reflected in tbc ' 
^^^ij^ Auorescence intcnfiicies in eairh pair of completed 
^jyipl^oitionfi. There was a significant increase in fluorea* 
^j^noc otily where a ^-globixi allele DNA matdied the 
ntitocr set. WhcD measured • on a spcctroflnorpmctcr 
(fjassi shown), this 6.aoxx»ccn.cc was about three tixncs 
ji^ present in a FCR where both p-gfobm alleles were 
^inatcbed tp the primer set. Cel cicctrophoxccw (oot 
^owfi) established that chts increase in fitiotescenee wa$ 
(jue to Che synthe^ of nearly a mkrogtam oC a DNA 
^9gaicni o£ the expected size for ^lobin. There was 
fitdc syntHestt of dsDNA in reactiotis in which the aflete- 
tfiedfic immer was mismatched to both alleles. 

Gootittvova iiMHsxtming of a PCR> Usic^ a 6bcr optk 
dcvkerii piMKibte to direct exidtation llluminaM>n from 
ji ^pectrolluorometer to a PGR undcrgoitig thcnnocydia^ 
and to retwrn its fluorescence w the Rpearoftuoiwweier. 
«riic fluorescence readout of sudx an arran^exnenty di' 
fccttd At an £tB r^ntaintng amplificadon ot y^romo- 
some spcdGc sequetKCS from ?5 jij of human male DNA» 
U shown in Figure S. The readout (torn a control K^R 
no target DNA is also shown. Hiirty cydes of PCR 
yem monitored for each. 

The ftuorcsccrtcc trace as a function of time dearly 
s^Mws the eflfcct of die thertnocyding. Fluorescence inten- 
tly rises ajod fiadls invcpdy with tccspcraturc The fluo- 
rescence intensity is minimum at the denaturaiion tem- 
perature (^*C) and maxinium at Acanncalin^exlemian 
tcmpcratwix: (50X). In the negative-control rCR^ these 
tfuorcscenoe maxima and minima do not change sigaiS- 
csinUy over the thirty AcjrraOcycJes, indicating ihat there is 
IHile dsDNA t9yntbe«ds without the appropriate target 
DNA, and there is little if any W«»dung of EtBr durmg 
(he continuous sllvminanon of the samj^. 

In the FCR containing male DNA, the flqorcscencc 
naxim^ at the anneafing/eaccension temperature begin to 
iacreane at about 4000 seconds of thermocyding, and 
Gontintic to incx^se with dme, indicating that dsDNA is 
being produced at a detectable leveL Note that the fluo- 
rescence minima at the denaturatioQ tctnperanire; do not 
signi£candy increase^ presumably because at this temper- 
atuxx: there is no dsDN A for EtBr to bind- Thus the ccfurse 
of the amplification is followed by traddng the fluorcs-. 
cenoe increase at the annealinjf temperature. Analyws of 
the products of the$e two amphfications by gd clccbnopho- 
rows showed a DNA fragment of the expected wze for the 
male DNA contaming sample and no det^ctaWc DNA 
synthed5% for the control sample. . 

DISCUSSION 

Downstres^ processes such as hybridi7.axjon to a se- 
q«eo£e-&pedfie probe can enhance tite specifidty of DNA 
deietJiiMii by PGR. Thic chmioation of Uicac proccssca 
means that' the spcdfidty of this homogeneous assay 
depends solely on that of fCR. In the case of sidJe-oeli 
disease, wc have shown that PGR alone has $uflkient DNA 
sequence apcdfidty to permit genetic screening, Using 
appropriate amplification conditions, there is litUe non^ 
9pcdfic production of dsDNA in the- abococe of the 
appropriate tai^ allde. 

The spedfictiy required to detea patliogens can be 
roore or less than that rcijuired* to do genetic scrccnm^ 
depending on the number of pathogens in the sample and 
the amount of other DNA that muet be taVen with the 
sample A dlfikult target is HIV, Which requires detection 
ci a vwal genome that can be at the level of a few eo|^ 
per thotL<ands of host cells*. Compared with genedc 
^creenmg, which is performed on cdls 5x>ntamiDg at least 
one copy of the target ioquettcc HIV idetectk^n requires 
both more spetifidty atid the input of more lota< 
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.^^^ „UVphot!Ogr*p!rt^of FCRtulK»ODiiitaiirafi<am^ 
^fi? EtBr that w spea^ to wildtn>e (A) m sidJe CS> alldat of 
the fiWin ^••gM'm gene. The IdBt of 

dIkl«-»podBc pHmers to the %nkl-typc afldes. the right tube 
primers to the «*Ie aflde- The phmc^ph wa* tai«i after SO 
<ydcs of PCK >wi the input DNAs anidl the alkJes ihnr amtadd 
m mdkaited Vi^ tog of DNA was used to bean FCK Typtja;^ 
was done in tripiicBfic (3 paits of fClU) for cac$ inpui DNA: 
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nCVKS C(mhnuous^ rcd<time monitoring of 
wa* used to cany cxdtation ftglit tn a ttiR *m pogrcss and also 
emhted Ught Ixtti to a flooromctta- (sec Exp^^ 
AmpHfiaiBoainlja^iinman nido-DNA apcdfic pnraexs m a TQR 
st^rnxkg with to ng of human nuuc DNA \*ofh or jn a OMtnn 
POTrnthout PNA (bPltom), were monhopcd. Thirty cyd« of 
rCR wett; fdl&itfcd for cadi. The temperature cytfcd between 
Sit; (denaturadon) and 50^ (annealing and eJOtcnwOTi). Hoic in 
the rode DNA PCK^ .die cyde (droc^ d<pie«lcnt increuc bi 
fiuonsGcncc U amttsaBngtextcnwrn tcmpetature. 
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DNA— up to microgram amouncs^a order to have suf- 
Gdcnt numbers of target sequences^ This large amount of 
Atartitig DMA m an atnpU(i<:atSoo stg&i6c^Tttly increases 
the bacKgTO«rt<i fluoKScence over wbidi a»r ^^Jdmona! 
flaor^soenofr pttxluced by 7CR rdu$t hc dcttt:te<i. An 
additional compHiration that occurs with targets bw 
copy-nimiber ts the fonnadon of die ^^pdmer-dimer" 
amfacu This is ihc rcsuJt of the cxten^ni of one primer 
ii»ng the other pnmcr as a template. Although this occurs 
infrequently^ Once it occuri the extension product is a 
substrate for PGR amp]ification« and can compete whh 
true PGR targeu if those targets are rare. The primisr* 
climer produa i$ of course dsDNA and th«.s is a potentbl 
source of false signal in tMs homoffCiie<>u5 ai^y. 

To increase FCR spedfidty ana reduce the cBTect of 
primer-dimcr antpUfKatioa, we are inviesdgating a num^ 
bcT of apjprnachcs, indudtng the use of ncstedrprimer 
ampl^kazions dtat take plaoc in a ^ngic tubc^, and the 
^t-start*', in which noiispedfic at^pU^^Mott U reduced 
by raising the temperature of the reaction before DNA 
synthesis bcgins^^. Prdhninary resuhs using these ap« 
pnjaches $ug]g[e$t thatT>Timer-dinxCT is cflFcctivcIy reduced 
and it is possible to oelect the imrrtfase in EtBr fluores- 
cence in a PGR insdgaced by a single HIV genome in a 
backgroutid of 10* cxdU. With largct nulnbet^ of ectts, the 
back^xnxnd fluorescence oonoilTuted by genomic DNA 
beo^be^ probleniatic To.reduoc tins badcgrwnd, it may 
be possible to use sequence-spedlic DNA4>inding dyes 
that can be made to^rcferentiaUy l»nd PGR product over 
genomic DNA by mc^KfJOrttii^g the dye-b«iding DNA 
sequence into the FCR product ardugh a ^'add-on" to . 
the oligontideoddc primer*''. 

We have shown that the detet^n of fluore$cet>c< 
generated by an EtBr-contaihing;^ PGR ts straightforv^ard, 
both once PCR is completed and continuously during 
thermocydlng. The ease widi i^ich automation of spe* 
cific DNA detection can hc accomplished is the most 
ptotobing aspect of this -assay. The fluorescence axialysis 
of completed PCR* is alreadvpossiblc with cxlstii^ig instru- 
znentation in 96-weIl fonnar^. In this format^ the Buores^ 
ocncc in each PGR can be ^vandtated before, after, and 
even at sdcded points during therniocydii^g by moving 
die rack of PCRs to a Se-imcroivcJI pbte fluorescence 
reader^. 

The instrumentation necessary to continuously monhor 
multiple PCRs simultaneously is also simple in principle. 
A direct cxtcn^ti of the apparatus used here is to have 
tnuldple Bberopdcs transmit the c^sdtatioa hght and 6u- 
oresoent emissiEOEis to and from mutdple PCRs, Tlie ability 
Co monitor muIdpJe PCRs continuously may allow quao- 
titation of target DNA copy number. Egure S sho«r& that 
the lai^er the azhount of starting carg^ DNA, the sooner 
during PGR a fliion^scencc incrca^ is detected. Prelimi- 
nary experiments <Higiichi and Dot&nger, manuscript in 
preparation) with continuous monitoring have shown a 
Scnsidv^ to two-fold differences in initial target DNA 
concentration. 

Conversdiy« if the nutnber of target molecules is 
known — 11 can be in genetic screening— rcontinuous 
motutoring may provide a means of detecting false posi- 
tive and false negative re$u|t$. With a known number of 
target molectdes. a true poddve would eichSbit detectable 
fluorescence by a predictable number of cydes of PGR- 
Increases In Huorcseence dete<:ted before or after that 
cycle would indicate potendal artiiBacts* False ne;gpdvc 
residts due to, for eacample.. inhibition of DNA potyiner- 
ase« may be detected by indud^g vnUm cadi PCR an 
inefficiendy ampHfyitig marker, Tliis marker results in a 
fluorei;GenGe Incxease only a&cr a large number of Cy- 
cles— many more than arc ncixssacy to dcom a true 



positive. If a sample fails to ha\'c a fluorescence incrca^ 
after this many cycles, ioha?ition may be suspected. Since, 
in this assay, conclusions are drafyit based on the presence 
or abscooe of fluore^certcc signal alone, sudi controls tnay 
be important. In any event, before any test based on this 
principle is ready for the dinic^ an assessment of it^ fake 
posidve/fabe negadve rates will need to be obtained Xidng 
a large number of known samfi^cs. 

Ill Nummary, the indusion isi PCR of^dyes whose fluo- 
rescence is enhanced upon tnnding dsDNA makes it 
possibk to detca specific DNA amplificarion from ouislde 
the PGR tube. In the future, instruments based upon this 
principle may fadHtate the more widespread use of PGR 
in applications that demand die h%h throughput of 
saolpkas* 

EXPEIRMENT AL PROTOCOL 

HiuBum HLA-DOn 4MkLpliikaUtotts ctrntaiiuag £t8r, 

PCRs were set ao iniOO }U voliufDes eonolning 1 0 mM Tris-KQ, 
pH &S: 50 mM KQ: 4 mM MgOz: t3 units of taa DNA 
polyiuct;ise (Ferkj«t<>£}mcr CCctu, Norwatk, CT); 20 pniolc cscli 
of btiuun HlA-DQa ' gene spcdfic oligonucleodde tnimcrs ; 
(iHt& and CH27^ aod ^prooaxnatelf W copies of ViQci FCft 
product diluted fTt>m a ptedous fc»ctw. EmiiKuin bromide 
lElBr; Stftn^O w^ used H Cbe conceiUiattonE indtoited in Fi^rc 
2. ThenDocyding proccedod for SO evdes in a model 4^ 
thermocyder {Perkm-EJ«ef Cetn^ hJonratk, CT) using a "stcp- 
cydc" prcenun of for 2 nun^ dcnaturatxcin and 6<rG for 90 
scc flniK^uing and for 30 kg. e&tensCo». 

Y-chcomosomc vpcciac P<^ PCRs (100 |d tdta! r^<iiort 
volume) comaitiing OJ* Hfi^ml £t3r were pix^p^ied as described 
for HLA-DQo(, except widi different primcA and target DN As. 
These PCRs cont:»incd 1 5 pmolc cadi male DN A-spcctbc prHnet? 
YI.) and Vl2^, and cidicr 60 ng male, 60 t)g£eni9!e, 2 male, 
or no human DNA, Thennocydittg was WH; Tot 1 min. and 60SC 
for 1 tnin uang a ''&ep<ydA^ ptM^taiU^ The number of cydes for 
a sampk were as indicated in Figure FhiQn:sGcncc measure^ 
vnent K described below. 

Alleie^4pcGifi^ hora^ p-globiii ^ent J?CR. AmpUficauons ef 
100 fJ v^aflrae' using 05 jifiAnl of MjiBt wcpc prepared a« 
descried ior HLA^DQk above except with diSeroU primers and 
target 0NAS. These PCRs cOtiUined eidier. pnmcr pair HGPi/ 
Hipl4A <wBdHype globin speeiae primcoi) or MOmlplHlS (akk- 
Ic-giobin spconc pnmcre) ac 10 pmcde c^h^primcr per KR, 
Tlwse primers w^ developed by Wu ct aL". Three different 
iAtffnuNM were tued in separate amptiflcstions'^ ng eadi ni 
human DNA that was homorygpus for the sk^lc trait <$S), DrIA 
that was hcteroaTKOus for the skkle trtrtt <A$>» or DNA that yn^ 
hom<»r|;OUS for the W.C^obiK^ (AA). Thcrniocycfing w-«iis for SO 
€yc^at94X for I nun, >nd 65*CCor 1 min. udttg ,i "stcp-cycfe*' 
program. An anneadtng temperature of 55X b*dhcenf sMuvm vy 
Wn ei al^^ to provide alldc-spcdfic atm^totion. Completed 
PCRs were phmographcd tfatough a red filter <Wraliefr23A) 
after pladng the rcatSKm tabes aiop a model TM-36 tranflfflufifti- 
nator (UV-produetS Saii'Cabdcl, CAy. 

Fhtmeseoiee neasnremetiE. Ftaotesoet>ce measurcmeni^ were 
made on l»CRs contaSning EtBr m a Fliiorolog'2 0uoromc^ 
CSF£X« Edison. NJ)- Ejcdtattkin was at the 500 nro band with 
About 2 nm bandwidth wilh * GO 435 nni cut-off.filw jMi^ 
Grist, inc., Irvine. CA) to cxdudc sccond-otder ugfat. Eromeo 
y ght was detected at 570 om with a baitd width of about 7 nm^ An 
OO 530 urn cut-off jfiltcr Was used to remove the cxdtadon figJst 

ContitHMUA fInoreiGence monitoiitig of FCR. Condnuous 
motiitoring oiP a fCR in progress was accomj^isbed uamg die 
spoctrofluonHnetet and setdnga dcscrcbod above as weu as a 
liboopockcessory (Snuccat no. 1950) to both send excttajj^ 
light DO, and tecdve emitted light from, a PCR placed m a wcU <tf 
a modd *80 !her««>cydcr (PiSkinrElnier 
of the fiberoptic cable was a^ached with iwoute-cpo^q-*' v> t^ 
open top of a FCR tube (a 0,5 ml polypropytoic centiifi^c tube 
ynxh hs cap removed) ^effeawcW flcafing rt- The exposed "UH^^ 
ihft PCR tube and the end of the fiberoptic CjiWc were slijewed 
f\pom room and the rOO« lights were kept dimtned dytmg 
each nip- The monuorcd PGR was an amj^Iiftciddn <rt V-**^ 
mosdmG-spcdfk repeat scqoei^ees as dcscnbed above. 
usmgan anncaUag^ension temperauirc of 50^ The reaCtSOn 
was d6vercd widt mukend oil (2 drops) to prevcitt evapdra&on. 
Thenwcydinyand fluorcsociiGC Dcasuccmcot n-ere started si- 1 
multanoouslr . A timc4Msc sc*n wit6 a lO sdooad mt^iradoA' tunc I 
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iv-rin uted And chc <!mlls«ion 5%nal was ratiocd tn the excitation 
!ugt>«^ ^0 cotiirot for <ftiWfc66 ia ]i|ht-4oum: mdctunty. Osioi.wcre 
^iS^ccLcd oaiag the dna3O0Of, vemOQ 2.5 (SPEX) ibta systxrni. 

Wc fhAftk Bob Joncft for hdp with the ^jectrofluormcmc 
ip(}iutff«in«n.U Mid-Hcalherbdl Fon^ for cdhing this manuscripL 
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IMMUNO BIOLOGICAL LABOI^ATORIES 



SCD-14EUSA 

Trauma, Shock and Sepsis 




The 00-14 moiecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound 00-14 is a receptor for lipopolysaccharide 
(IPS) complexed to LPS-Bindirig-Protein (IBP). Ihe 
concenirailon of Its ^olubie form is aftered under 
certain pathdogicai conditions. There, is evidence for 
an Important mte of sC0-14.vyith pofytrauma, sepsis, 
burnings and inflamrriations, 
Ouring septic condidons and acute infections ii seem$ 
tobeaprogriosticmaff^erandistheieibreof vsAiein 
monitoring these patients. 



IBL offers an EUSA for quantitative determination of 
soIuW© CD-14 in human serum, -ptesma, cell-cufture 
supernatants and o9idr biological fluids. 

12x8 determinations 
(microSter strips), 
precoated with a specific 
monoclonal antibody. 
2x1 hourinout>ation. 
standard range: 3-96 ng/ml 
detection limit: 1 ng/ml 
CV: intra- arid interassay < 8% 



for more infonnation caU or fax. 
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We bave enhanced the polymerase <:hain 
teaclioii (PGR) such that specific 0NA 
sequences can be detected widbtout opesi- 
ing the reaction tube. Hftb enhaii^einefit 
leqnires the mldition of etfaidinm bjtomide 
(EtBr) to a PCR. Since ^e fliiKirescrace of 
BtBr increases in the presence of double* 
stranded (d^) DNA an Increase in fin<M?es- 
cence in stidbk a PCR indicates a positive 
amptificationy which can be ^^VLy mmir 
toted externally* In £aict, amplification can 
be coutinuonsly nKi^nitored in order to 
follow its progiess. Hie ability to simulta- 
neously amplify specific DNA se^ences 
and detect the product of the amplificati<m 
both simplifi^ and improves PGR and 
may fsicilitate its automadon and more 
widespread use in tiie clinic or in other 
situations requiritig high sample through- 
put 

Although the potendai!?!cnc6.w of PCR* to.clip- 
kal <&gnosdc» wcHkuLOwn'-Mt h sufl net 
%^faiely used ijnt this setdng, even thourfiit is 
f<nl^ year* eitu^ thcra*Qi*!ibI«! P^^A P^b^^" 
asc5* m^idc PCR pwicticaL Some of the rfeasbns for H slow, 
accepiatice are high 0Q9t, \^ of autoniad<m of pre-? and 
post'PCR proocMing ftteps, and labc poritive results, from 
carryavcr-contamination. The &T5t two pomts »rc related 
in th^it Ubor is the largest cotttributor to cost ait the present 
stage of PCR development. Most Curteiit assays rcsuite 
80II3C form of "downstPcam" processing onw ^ettnocyw 
ding t£ done in order lo determine whether the taigct 
DJ^A sequcivx was present aad ha$ amplified. Theie 
indode DNA hybndwuon**, gel etecmjpbo^^^ 
^'itbout use of TOtrioion digestion^.**, HFtCr , or capSiary 
ekoTophorcsU'^. These method* ite )abor4nt€n9c, have, 
low (htnughput, and arc diiBBcuk to aoCOToatc. The third 
point is sSjo ckisdy related to downstream prooesslng. 
The bandliivg of the PCR produa in thcM dowrotrcam 
processes increases the cbaiices thai aixi|;dSHed DNA .yrift 
spread through the tjfpng hb, i^sulang ip a .risk of 



^cartyover" felse positives in subsequent cesdng^ 

llicsc downstream processing steps would be elimi- 
nated ff specific amplification and detection of amplified 
DNA took place simuhaneomly vtihin an unopened re- 
action vessel Assays in v;hich such dilFcrent processes take 
place without. the oecrf fofieparate reaction components 
have beet» temed •^homogeneotis'', l^o tndy Iwmogc-. 
tieoas PCR assay has been d<?nonstrated. to date, akhough 
progress towards this end has been rcjKntcd.- Chdiab, et 
aL*% developed a PCR produtt detectbn sch^iixc using 
fluorescent pximers that resulted in a fioorcscent PCR 
product Aileic-specific primers, csch with different fluo- 
resdent tags, wore used to indicate the genotype of the 
DNA. However, the oninciorporaced pruncrs ^nust stiU be 
removed in a downstream ptxxess in order to visuaHzc the 
Rwcndy. Hoflai^, et al.*^ devdopcd aa assay in 
iwfudi the endogenous 5' <;3w>0udease assay of r<2f DNA 
pdymerafie was exploited to cleave a labeled oJigonudeo- 
Ude probe, The probe %«wild <Kily cfcave if PCR ampHtx- 
cation had pioduced its ooroplemeatary sw^cnce-In 
order to decea the ckavage pt^uots, however, a subse- 
quent process is again needed, . 

We have developed a tttOy homogeneous assay for PGR 
and PGR prodbiei detecdbo based upon tbc gready lo- 
ereased fluorescence that etludium titoinide and other 
DNA binding dyes exhibit when they are bound to.ds- 
DNA^*^*^. As outened in Figure I, a piototypic PGR 



/ 



(op i0|l{:J»MHIUt^ 




1 Priodplc of sizmxltaQcoua aoopEficsdon and- detection Of 
PCR pTDducL cxniip6nciit£ of a f<lR <»^^ 
duorewnt arefiswd— ttBr .ttselfc EtBr bound toddtcrssDWA or 
dsDNA. Tbct« IS « Ivge fiuorcsccooc cnhanrrtncitf when EtBr is 
bottod to PNA and bmdinfr u greatly ct^nakl when DNA .Is 
ddable-strandcd, Ater suiudent (n>..^d£s of PGR* the .net 
tao^aoe in d^NA resuks in addhiaa^ EtBr t^juiifin^ and ^ net 
mcrcasus tn total fluaEcsocncc: 
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product 



MmtGd«locti^pIlQre^of K:&^plificat^ pro4v^ of chc 
huQKw, tcudcar gene. HIA DQtt, made in the preseacc of 
tncrc;»fjag amouats of £tBr (up tx> t WtdQ. T|lc presence of 
&tBr b» tio obvkw efiTect on the )4dU 
outoa. 




P 17 21 2S 29 <r«*" 

\ \ 17/ 
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(A) Ituoicscence i&casurcnients {rom FCRs tliAt omtain 
p^nd and ate spcdfic ftfr V^trotnosooae pra»cst 
MOtfetioc^. Five rgdicsce FGRs «rere begun conuliiin^ each ot the 
DNA«9pedM. A ieachindkntc^ cyd«, one of the five repGcatc 
FCfts for C9di DNA was foiio^ from thcrmocycSng and tt$ 
fluciFciKcsice oicasured. Units of fluormecnce Are aibitrziT. (&} 
UV |>lMU>gnit% of FGRtubet (D,5 Eppcndoif^cylc, ixHypro- 
pykiM mki^»<«ntri^TC:tul]C8> contawln^ reactions, those j(Ca.tt^ 
ing from $ ng male ONA auid oimtrol xvactiom without anf DNA^ 
from (A), 
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begins ivith priiaen that are sitigle-stniodcd DNA (ss- 
DNA), 4Ifm, njEMi DMA polymerase: An' amount of 
d5DNA containing ih^ target aequcnce (target DN^A) u 
also tyjMcally present. Thl$ aimovac can vaxy, dej 



on the appikadoo, front slngje^ceU axnotint$ of DNA^^ to 
micro^^rams per ?CR^^ If ElBr is present^ the reagciuii 
that fluoresce, in order of increasitte^ fln<»rcsGcnoe, are 
fvvc EiBr hscK axul £tBr boudd to the dng^fe^urandcd 
DNA ptlmeta and to the doutk^stranded tai^get DNA (bf 
its mtercaladon between the sfcadccd ba$c9 of the DNA 
dooblc^hcfix)* After the first denatutation q^ck^ target 
DNA wiU be largely $ii^-9tranded. After a VCR is 
comideted. die most slgQi&an£ change k die increase in 
the atnount of dsDNA (the PGR prepuce itself) of up .to 
sev^^l mieri%gran». Formedy free EtBr is bound to the 
additional dsDNA« rcralting in an mcrease in fiu^rcs- 
ccnce. There is also scmicm! decrease in the amtmnt of 
ssDNA pciiner« but b^cau^ the binding of E^r to S6DNA 
is tnudi lc«$ than to cisD^lA« die effect of tins on 
the toi^ fluorctccdcc c^che sample is smalL The Dvorcs* 
cencjc increase can be measured by dit^ecting cxdtatioa 
iliuminatian thro«lgh the mUs of the ampltlkaifOA vessel 
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bcfarc and after, or even coniinxcously during, chennocy. 
R£$UI.TS 

PCR in tiie presence of EtBr. lx» order to asseijs th^ 
afiect of EtBr In jPGR. ampMcatious of the human HLA 
DQa g^sne*' Vfcrc performed with the dye ptx^ent at 
concentrations from 0,06 to 8.0 p^ml (a typ^ omcea. 
tration of EtBr tiered tn staining of nuckk aads fofio^iing 
gel ekctroplioresis is 0*5 v^^m^). As shown in Figure 2, pi 
ekexro^hc^csis rcveaied little or tio dijSercncc in the ^ield 
or quaJuy of the amplification produa whether EtBr was 
al»enl or present at any of these conoentratkm^, indicat- 
ing that EtBr does not inhibit PCR. 

DefeetioR of faoman Y-dmrnkoscMao specilic 
^oeDces^ Sequence^pecific fluorescence enhanoemeat of 
£cEr a£ a result of PGR ^vas demoiueral6d in a $cnes of 
ampjificadons containing 0,5 v^ml EtBr and pttmers 
speofi^ to repeat DNA sequctKcs lonad on the huxnan 
Y-chromosomc*^. These PCRs initially eontatned either 
60 ng male* 60 female, 2 ng joalc human or no DNA. 
Five repficate PCRs were begun for each DNA* After 0, 
17, 21 , 24 and 29 cycles of thenuocyding, a FCE for each 
DNA Vp^as rcoxovcd from the thermocyder^ and it$ fluo- 
rescetice mea^urt^ in a spcctrofinorotneter and pioucd 
vs, amplificadon cyde number (Fig. 3A), The shape of tJfcds 
cikrve rcSccts the £act that by me time an increase in 
fluorescence can be detected^ the increase in DNA U 
becoming linear and not exponential with cyde number; 
As shown, the fluorescence iooeascd about three-fofci 
Over the l^K^gronnd iluotescence for the PCRs c^tain- 
iog human maic DNA, but did not signiftcandy increase 
for ncgadve control FCRa, which cnntsnned either no 
DNA or human female DNA. The more maJe DNA 
present to btgin with— 60 hg versus 2 ng— the fewer 
.cydeii were fwjeded to give a detectaWe increase in fluo- 
rescence. Od electroohorcsi* oo the products of these 
ampMcations showea that DNA fragments of the ex- 
pei^cd skc were made in the mak DNA oonitamuig 
reactions and that Kule DNA syattiesb took place in the 
control samples, 

in addiuon, the i23CFca9c in Buoresccnee yra& visualized 
by simply laying the completed, unopened PCRs on a UV 
tramilhiininator aad pftotographing them through a red 
filter. This is shown fn figure BB lor xh& reactions thai 
began with 2 ng male DNA and dbiose with no DNA- 

Deteetion of speetee aIl4;Je« of the httman 0-^obia 
gene. In order to demonstrate that this a^pr6adi has 
adequate spedfidty to allow genetic screening* a d<bccaion 
of the' ^kle-odl anemia mutadon was performtxiL F%ure 
4 shows the (^uoresoencc from t^plcccd ampli&caiio^ 

containing £tBr (0.S v^QftitiS) a« de!ttict6d by plhtotogfaphy 
of the reaction cubes on a UV transilltmiinator. Tliese 
reactions were performed uang prhncrs spedfic for ci- 
ther the wad-m>c or sidde-ceil mutadon of the human 
p^obin gene* . The *pecifldty for each, alklc is imparted 
by placing the sidje-mutadon site at the temunal V 
nucteoqde of one primer. By using an appropijate primer 
annealing temperachre, ptimet* otteu^MOi^n-'^d thus am- 
pli6catJon^H»n take place only if the nudeotide of the 
primer i$ coiuplcnicRtaTy to the ^h^fMn alldc prciicnt^'*^. 

Eadi j>air df ampBiicadons shown in Figure 4 consists of 
a reaction with either the wildHype aliete qpectfk (Icffe 
tube) or skkk-aUde spedEc {ri|^t tube) primers. Three 
d^erent DNAs were typed: DNA from a homozygous, 
vrtid-tTpc p-globin individual (AA); from a hetero^fgcni^ 
sickie p^i5:>bin indrTidu^ (AS); and from a homozygous 
sidde ^gM^o indmduai (SS). Each DNA (§0 ng gnomic 
DNA CO starleadi PGR) was analysed in tripHicate (3 paits 
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fractions each). The DNA type tv^s reflected iu tbc : 
^31^ Huot^scetice intcn&iues in each pair of tomf^^d 
^ml^^catioiu. There was a significant increase in fluores* 
otHy where a ^•^bin aQele DNA matched the 
l^oicr ^t. When mitasured oa a spcctrofltioFOinctcr 
Ugt;^ not shovrn), this Euorcsccnce was about three tioies 
ji^t present in a PGR where both ^dobin alleles were 
^itiAtched tQ the pHmer set. Gel cla5trt>ph<>re5)$ (ooi 
^wii) established that this increase in fluorescence ws 
doe tfi the synthesis of nearly a microgram of a DNA 
li^^cnt of the expected size for ^lobin. There was 
Ptdc syroHesifi of dsDNA in reactions in. which the afleie- 
tfiedfic primer was mismatched to both aikks. 

Cocotitmov^ movatovia^ of a FOR. Usioig a £ibcr optic 
derkeril po«fiible to direct excitation illumutadon from 
^ qiectrofluotomeler to a ?CR undei^goixig thcnnocydic^ 
and td return its fluorescence to the spectroftuowwefer. 
The fliiorc$C;cncc readout of such an anrai^ementf di' 
^ed nt an Etfir-containtng amplificadoa of Y'diromo- 
Home spcciCfcc scqwcocjcs frotyn 25 of hum^n male DNA, 
U shown in Figure 5. The readout from a control PCVi 
wih no target DNA is also shown. Thirty cycles of PCR 
monitored for each. 
The fhiOTCSomce trace ajs a function of time dearly 
shows tiie ef^Krt of the thennocyding. Fluorescence inten- 
aSxy nses^and.£db invcrsdy wiOsi teflflpcraturc. The fluo- 
teMcnce intendiy is minimum at the denaturation tem- 
peniture (d4**C) and maxiniuin at the anncatin^^ctemian 
temperature (5<rC). In the negative^ntrol FCR» these 
duorcscencc maxima and minuna do not diange sugai6- 
ojndy over the thirty tbeirmOcydefi; indiotlng that there is 
thtk dsDNA 9ynthe«is without the appropriate tsx^ct 
DNA, and there b little if any bleaching of EtBr dtitmg 
the continuous flluinination Of the sam^. 

in the PCR containing male DNA, tiw flqorcscencc 
maxima at the annealingfextension tcmpcxaturc bc;^ ici 
increase ai about 4000 seconds of thenmoCyding, and 
contiotie to increase with time, indicating that dsDNA is 
being produced at a detectable kveL Note that die fluo- 
ixsccnce minima at the denatiiratioa tetrtpctatore do not 
aignificandy hiereasc« prCfUmaMy because ai this temper- 
ature tbeix; is no dsDNA for EtBr to bind. Thws the course 
of the amplification b followed by tracking the lluorcs^. 
cence increase at the aojieafinj temperature. Analysis of 
Ihc products of dic$c two amphficadons hy gel ektlropho- 
fcMs sbow«d a DNA fragment of the cicpected size for the 
male DNA containing sample and no detectahle DNA 
symhedsi for the control sampWL 

DISCUSSION 

Downstream processes such as hybridization to a se- 
queoce<«pedfic probe can enhance die specificity of DNA 
dcfci-tivii by FCR, The cHzbiiMtKm of dicac proccssca 
tncans that' the spccifidiy of this homogeneous assay 
depends solely on chat of FClL In the case of ckUe-ccU 
dtfcase, wc b»vc shown that K5R alone has sufficient DNA 
sequence apcdfidly to pcnnit gencdc scteening< Using 
appropriate amplihcatioa conditions, diere is little non- 
specific producdon of dsDNA in due- ab9cn« of die 
(ipjproprialc target allele. 

The spedfidiy required to dctJcct pathogens can be 
more or less than that recruited' to do genedc scrccx^ng, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample A difikuk target b HIV, which requires detection 
of a vhril genome that can be at the level of a few co|^ 
per thotiitands of hose cells^ Compared widi genedc 
^creemng, which is performed on cells jcontaiiung at least 
One copy of (he tai^et ioqucartce* HIV idetecdon requir:cs 
both more spedfidty »i»d the input of more total 
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ly of PCR tubes GEmtsuimi^ la 
iisiag Et0r tfin are spcaltc to wild-type (A) or fic&k (S> alldcs of 
the &mum ^-^^^ gcaie. The IdBt each 1^ 
^Bde-tpGofic plimers to the wiid-typc afldks« di£ nghf labc 
primers to the wAte attcle- The phmo^ph trf^co after 30 
cydcs of FCR. aod the input DNAs axMl the alkl«s^^ comaiD 
are mdicsted. ¥i£fy ug of DNA was used to b^n FGSL Typmg 
was done in tiipiiaite <3 pMTS of FCfts) for «acji inpm DNA: 



20r>g of mate DNA 





r-r- : 1 = r r 

0 2000 4000 6000 8000 
time (sec) 

HCWSCcHiaauou^rcaUnne mowtormgofaPC^ 
was^Mcd to c?irry cwilaiiOtt fight to a t^R in progr^ auid akp 
emHted boi to a Bfiorometer (sew E^^p^ 
AnroSficaSbn-ilsingliaiiwn nwIe-DNA iapcanc pnoicn m a PpR 
st»&g with to ng <rf hwinaa niak DNA <t<»>, w 
PCR inthout PNA (hotUMtt), were, noonnorcd, Thiitv cjrd« of 
VC3Bi were foHowcd fiw each* The tempcratun! Cjr^cd between 
WC (denatMradotn) And 5(rC (anucalidg and exicnWOTi). Note in 
the male DNA PCE,.the cyde (drac> dcpinsSdcDt inoeiuc bi 
ihioracciice at thd amttsaBng/extensimi tcapetatitfe. 
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DN A---up to microgram amounC»-*Ha order to have suf- 
fidcnt n\uzibCT6 of target sequences, Tht$ Isurge amount of 
xt^iiig WA m an ampUSc^tioo stgni&c^nUy Increases 
the bacKgrourtd fluorescence over widt any additional 
fluorescence pttxluced by PGR must be dct/Me^. An 
additionai compHcation that occurs whh targets in tow 
copy'-number b the fonnation of the *^pnmer-^iinet^ 
artifacc This is the ccsidt of the cxtensioni of one primer 
using tibc other pritner 9S a template, Afthotigh this occurs 
infrequently^ Once H ocscairs the extension product is a 
substrate for PCk ampllfkation^ and can compete with 
true PGR targets if those targets are rare, Ihe prim&r- 
diiTier produa « of course d$DNA «nd thus is^ a potcntnti 
source of false slg;tial in this homogeneous a^y. 

To Increase PCR spedfidty ana reduce the efiTcct of 
prifneiMiimcr axilpUfkation* we are invesdgatlog a ntim> 
bcr of approaches, indudtng the use <rf' ncst^'-primer 
amplifications that take place in a ^ngie tubc^. and the 
^ot-sUrt*\ in which nonspedSc amplitki^n is reduc^ 
by ndsmg the temperature of the reaction before DNA 
synthesis begixts^^ Prdhninary results using these ap- 
proaches sugj^est tbat-pnmersiinieT b effectively reduced 
and it is possible to aetect the imrrcase in EtBr Huorcs- 
cencc in a PCR instigated by a single HIV genome in a 
backgroutid of lO' cdts. With larger ^umbers^ of c<^s, the 
background fluorescence comnbuted by genomic DNA 
becomes problematic. To reduce this backgrotiod. it may 
be possiUe to use sequenoe^pedftc DNA^tnnding dyes 
that can be made toprcferentiaJly bind PCR product over 
genomic DNA by mcorporating the dye-binding DNA 
sequence into the FCR product thrdugh a ^add-on" lo . 
the oligonudeotidc primer*'*. 

We have shown that the detection of fluoresccttcc 
generated by an EtBr-contaiEiing fCR is straightforward, 
both once PGR 19 completed and continuously during 
thermocycfing. The ease with which automation of spe- 
dfk DNA detection can be accomplbhcd b the most 
promising- aspect of this assay. The Huorescence analy^s 
of completed PCRs b alreadvposstblc with cidsdTi^ instru- 
mentation in 96-well format^. In tliis format^ the fluores^ 
ccnoc in each PCR can be ^uantitated before, after, and 
evcu at seiccted points dunng therniocyciitig by moving 
the rack of PCRs to a 96-fnierowcJ! plate fluorescence 
reader**. 

The instrumentation necessary to oonttnuousJy momtor 
muhiple PCRs simultaneously is also simple in prindpie. 
A direct eTctenston of the apparatus used here is to have 
tnukipte fiberopdcs transmit the c;bdtation light and flu- 
or^cent errussioos to and from multiple PCRs. Tlie ability 
Co monitor multiple PCRs continuously may allow qqaiH 
Illation of target DNA copy number. Figure S shows that 
the larger the amount of starting car^ DNA, the sooner 
dfiHn^^ PCR a (hjornscencc increase » detectcftl* Prdinii- 
nary experiments <Higxidii and DoUinger, manuscript in 
prcpar^D) with eontittuous monitoring have shown a 
sensitivity to two*fold diifcrcnccs in initial target DNA 
concentration. 

Conversd^f if the nutnber of target molecules b 
known — as it can be in genetic screening— Tcontinuous 
monitoring may provide a tncans of dctccong falgc posi- 
tive and false negative result^ With a known number of 
target molecules, a true positive wotdd exhibit detectaUe 
fluorescence by a predictable nunUKr of cydes of PCR. 
Increases in fluorescence -detected before or after that 
cycle would indicate potential artifacts* False negative 
refitiks due to» ft>r examine,. inhibidon of DNA pofyroeiv 
ase« may be deteoed by indnding within eadi PCR an 
ineflioendy ampRfyiiig marker. This marker rc&ulcs in a 
Auorescenoe increase only after a large nambcr of cy- 
cles—many more'dban an? nepcssary to dcoea .a (rue 
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positive. If a sample fails to have a fluorescence increase 
after thb many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or abseoo^ of fluorcjwnoc signal alone, such controls rnay 
be importaoL In any evezit before any test based on thu 
principle b ready for the dinic; an asses»Dent of fabe 
positive/false negative rates will need to be obtuned' using 
a large number of known samples. 

lit summary, thje induston in PCR of dyes whose fluo- 
rescence b enhanced upon tending dsDNA makes it 
possit^ to dctca spcd6c DNA ampUficanon from oucsldib 
the PCK tube. In the future, instrumeats based upon this 
prindpie may fadfitate the more widespread use of PCSL 
m applications that demand the h%h throughput of 
samples' 

EXPERIMENTAL mmXX>L 

Hwnun HE<A-I>(>k geme An^HBcatioiis co nraining Et8r. 
Pdb were set «p iniOO pX volumes coniainiag 10 mM Tris-liCl 
pH 8.3: SO mM RQ; 4 oAf MgOz: tJ^ vnlcs of Tor DNA 
polymerase (lysrkSivJPmcr Ccmi. Nonnilk« CT>; 20 wriote each 
of human HlA-DQa'gcnc spcdfic ol^onudeodde nimcrs 
iiHt6 and Cm?^ and appnwaniaidy 10* copies of DQoi PCH 
product diluted from a predons ftnc^, Eihidium bromide 
cE<&r; SignvO was tise4 at tbe conocntia^ons .Indicated io HgurQ 
2. Thermocyding proceeded for 20 CVtles in a model 
OierBMxydcr (Perki«>-Elmcr Ceta«^ fkirwalk, Ct) uang a •*«tcp- 
cydc" piwnun of 94*C for 1 min..dciiatUTation and 6(rG for sO 
sec anneafing and 72^ for 30 sec extension, 

Y-chfomtwomc specific PCEt* PCRs ()00 |il ucal reaction 
volume) containing Oit jaWinl £tJ^ wctc pcxrpmd as described 
for HlA'DQot, e^ceept wSx differed primccfe and target DNAs. 
These PCRs costUUncd 1 5 pmob eadi male DNA-spedfic prtmei^ 
YI. 1 and aiid cither 60 ng male, ^ neftmale, 2 maic. 

or 00 hofnan DNA Thennocydmg «v^94H;Tof 1 min. and GO^C 
for J mm using a "itcp-cyde* p t o gr am . The number ofcyctes for 
a sample were as inolcated in ftgnte 3. Fluorescence aieasuTe>- 
meiK IS d^^scribed bdow. 

Attcie-spccificy hoinan p-^MnH ^cvx^ PGR» AmpUfications of 
100 fii voSuinc using QJ5 jtg/ml of £cBr were pt^rcd a« 
described tor HLA-OQ^ above except with differcnEptttncrs and 
target I>NA&. These PCRs couudlned «aiber. pnmcr par HGrt/ 
fiP 14A <wiW-iype globin spec& primcis) or HCKWipl4S (skk- 
Ic-globin spcanc primers) at 10 pmole <^ priowr per fCK, 
Hiese ptimcrs were developed fiy W« ct aL*\ Tbc«6 dtflfereni 
atgei dNAa ttsc^ in separate aoipUflcatlon&--4r0 »g ca^i of 
human DNA that was homozy^s for the siddc trail <3S)* DMA 
that was heteroxycous for the Stdde traH {A$X or DNA that was 
hOQdOXrgous for tjftc W.t- ^obin (AA). Thermbcycfing lyas For SO 
cydesat94^ for 1 nUn. and SSH^for 1 min. u«n« a •*stcp-c3fcfe^ 
program. An anneaKng temperature oSSSX' b*dneen .diown tiy 
Wu ei al.^^ to provide; alldc-spedfie awpBiw^ ConiPlS^ 
FCRi vtere phmogranhcd thtuugh a ted fitter (Wratien 
after placzBg the reacDon lubes atop a roodd TM*36 tratwifluo^ 
nator <UV-prOductS Saii'Cabrid, CA>. 

Fhiomce^ tneaaoianetn. Fli^tesoencc rocasurcmen w were 
mad^ on PCRs coatainin|: EtBr in a iluorolog-2 flnororaC^ 
^FEXL £dison. NJ). Excitatkin was at the 500 nni band with 
ahour 2 nm baodwiddi with a GO 41^5 nm cutMjff.^lwjMeB^ 
Crista Inc^ Indne. CA) to exdudc socond-ofder light. Ewittea 
yght %vas <ktea.ed at 5^0 nm with a baudvodth of about 7 nm. An 
OG 530 urn cut-off Jfi*tcr Whs used to renwve the excitadoo fight 

CuntltHMUA flnorencence momtorittg of FCIU Continuous 
monitoring a PCR in prOgrcJS waa aasomplidjed u«tg mC 
fipoctrofiuonHnetcr and setdnga describoa above as weU as a 
fiboupticibccessory (SP£X cat no. 1950^ 10 both send owitatioa 
fight to, and tecdve emitted light fnmu a PCR ptaoed in a wcU <fl 
a modd ^ ihermocydcr (Pcrkin-EkMr Cetus). The prc^>c end 
of the fiberoptic cable was attached with "5 mtoute<pOTy'' 10 the 
open top of a PGR tube (a 0.5 ml p<>h«pr<myJcnc ccntrifiugc tube 
vnth its cap removed) dRectwely acafiog tL The eacposfidW «[ 
xjh% PCR mbc and the end of the fiberopdc cs^ wcrc i^iieldcd 
from room light and die looeo lights were kept dimmed during 
- each nm. The monkored FCft was an amt^enion of y<diTO' 
roosdnuyspcdlK: repeat scquetKes as described above, except 
u^g.an anncaling^^tension teronevauirc of 50^ The rcacpw* 
was cioveted with xniiiteva] oil (2 drops) to prcvcftt evap<^anon- 
Ihennocy c£ng- and- fluorcsocncc rocasuccmcnt verc staled si- 
multanecHisly. A tiinc^Mse scati wi(& a 10 sdcood integradoiy tnoc 
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SCD-14EUSA 

Trauma, Shock and Sepsis 




The CO-14 molecoie is expressed on the surface of 
monocytes and socne macrophages. Membrane- 
bound CD-14 is a receptor for I'lpopolysaccharide 
(LPS) comptexed to LPS-Bhding-Protein (LBP), The 
concentration of Its soluble form is aftered under 
certain pathologtodl conditions. There is esHdence for 
an imponant rote of sC0-14.with poVtrauim. sep^s. 
bufrtings and inflammafons. 
During septic corvdittons and acute infections it seems 
tobea prpgnostlc<narkerandlstheiieforeof vaSuein 
fnonftoHng these patients. 



IBL offers an EUSA for quantrtative determination of 

soluble CD-14 in human serum, -pfa^na, cell-culture 

supernatant and other biological fluids. 

Assay features: 12x8 determinations 
(mIcroSter strips), 
pfecoated with a specific 
monociond] antibody. 
2x1 hour incubation, 
staffidard range: 3-96 ng/ml 
detectior) Dm^ 1 ng/ml 
CV: mtra- arKi interassay < 8% 



for more information caU or fax. 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J, LIvak, Susan j.A, Flood, Jeffrey Marm^rOy William Gtusti, and Karin Deetz 
. I^crkln-Dmcr, Applied IMQ:(y»tCfn« Plvlslon, Foster City, California 94404 



: The S' tiUCleaM PCft mtmmy datttcts tl«« 

[ AcciimuUtlotiaf »|Mdfl«PCRpr6<iuca 
' by hybrMtxaUofi «nd ddiva^e of a 
double-labelecl fluoro^enfc prob« 
during the amiiliftciitlon reiicUon<. 
The probe Is an oUgonudeotlde with 
both a r«port«r fluorescent dye atMl a 
quencher dye attached. An Increase 
In reporter fluorescence intenfllty In-* 
dicAtei thet the probe has hybridized 
to the target product and bus 
been cleaved by the S'-^3' nude- 
olytic activity of Taf DNA pcfflyfii«ras«« 
In thb wbudys probes with the 
quencher dy« aliach^ to an Internal 
nudeotf de were compared wHh 
protseft wlUi the quencher dya at- 
teched to the B'nind nucleotide* In all 
cases, the reporter dye was attached 
to the 5' end. All intact probec 
thowed quenching of the reporter 
fluorescence. In general, probes with 
the quencher <|ye attached to the 3'- 
end nucleotide exhibited a larg«r sig. 
nal In the 5* nuclease PCR assay than 
the Intemaify labeled probes. It; Is 
proposed that the lenicr signal Is 
caused by Increased IIKellhood »r 
cleavage by ra^ DNA polymerase 
when the probe Is hybridized to a 
template strend during PCR, Probes 
wIlli the quencher dye attaclted to 
the 3 '-and nucleotide also <tthlblt«d 
an Increase In reporter RuorMceMca 
Inionslty when hybrldlaed to a cont- 
ptemeniary straiad. Titus, oligonucle- 
otides with repcHrier snd quendiar 
dyes attacked at' opposite ende can 
be used at- tiomo g eneaw hybrldUa- 



homogXfneoui: asixsy for . dotting 
lliiVHixiairiiiUtluii of specific |»<JR prod- 
uct iha{ u$€$ a double-labeled fluoro- 
genlc probe was dc'j&cribcil by Lee ct 
The assay vxploirs the 5' - » 3' nucle- 
olytic activity of Taq l)NA poly* 
iiitriaatj^^-''* mid Is diagramed tin l-lgure U 
'Vhc fliiorogeDic (iitjlju i'(»fiyi):t$ of «n gll- 
gonuflcnlidc. wUli <i reporter fluorescent 
dye., .>ui.1» «i « fiuormx'ltii attached To 
the 5' f-jid; and a quencher dye, such as a 
rhodaminc. Attached Internally, When 
thtf tluoresceln Iscxdted by in-^dmion, 
lis fluoresceni emission will be 
quc^nchcd It tlic xluMlaniine b clos« 
enough to be cscdicd throu^ th« pro- 
cess of fluDre5CCiicv vrivr^jy transler 
(M%r)/^ During PCR, lithe probe is hy. 
bridJwid to a tcmplalti ^liaiid, Taq DNA 
polymerase will deave tJic probe be- 
cause of its inherent 3' nudcfolytic 
acUvily. If the deavagc occun between 
UlC fluorescein and rhodaminc dyes, it 
causes an increase in fluote^win fluores- 
cencc intensity because the fluorescein 
is im longer quenched. Tlie Incr^ajie in 
fluorescein fluorescence Intensity indi- 
ealcx ttiut the probe-ipcciflc I*CR product 
haji birufi ^iffHTiiicd* Thus, PET bctweftn a 
it!|njitri dye and a qucnchtfT dye Is tTltl- 
cal to llic performance of llir j^iobe lu 
the S' xiu<.ltr<i^e PCH a»axy. 

Quenching is i*ofnp]cicl)^ dependent 
on the physical proiutinity of \Hv two 
dycs.^'VBcx«usc of thb, ll luis Innm as- 
sumed that the qucndicr dye mu>l be 
attached neai Uie S* end. Suq»naiii}*ly, 
we luve found thai attaching a rho- 
doHiine dye at tltc 3' end of a puitit: 



PCR assay. lliKhe.rmorc, cleavage o( this 
type of pn)he i.s not fcquirea to acnievo. 
some rctlucUon In quencl^in^^.Olijjonn- 
clcotidcs with a reporter dye on the y 
end and a quencher dye on the 3' end 
(fxhibft o much liigher reporter flu Of OS* 
cviicc when douDie-stranoed as com- 
pared with single-strondecL Tliis should 
make it pa-ssiblc to use this type of. dou- 
ble* lal>e1cd probe for homogeneous de- 
tection of nucleic acid hyimdizatton. 



MATCRIAU AND METHODS 
ongomideotldes 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used In this 
study. Linker ann nucleotide (LAN) 
phoiphoramidlie was obtained from 
Glen Research. 'Ilie standard DNA plios- 
phoramiditcs, 6-carboxyfluorwcein (6. 
FAM) phosphoraiuidtte, <i<4irboxytet- 
pamethyirhodanilne sucdnlmldyl esiCf 
(TAMRA NHS ester), and Phosplwlink 
for attaching a :V-WocXliig phosphate, 
w«re oDtained trom Pc.rWn-Elmer, Ap- 
plied Blosystems Division, t)iigonucle- 
Otide synthesis was performed using an 
ABI model ^94 DNA synthesiser (Applied 
Biosystems). Primer and complement 
oUi^onucleotides were puriUca using 
Ollgu rui-incatiori Cartrid{;cs (Applied 
Blosysiems). Uwublc-lulM:k:d ;irf»bes were 
sytithoir^rd with <^MM*labeled pho5i> 
pliuidicifdilejtt tli« 5' itfid* I AH replacing 
Of Mi of tiiv Vs in the sequence, and fhos- 
phaltnk dt thc 3* cnd. Following de- 
|Mott«:tlon- and ctliaftui precipitation^ 
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FIGURt 1 Diagram of 5' nudeaje assay. Stepwise repres<;m«tlon of me 5' 3' nuclw>|yrtv ac- 
tivity of Tgq UNA pbiyznerasc aainK on a ftupiUKcnic ptvin: duriiiR u»c cxt(:n»1ui) p)ia$c <if l'(«K. 



mM NA*bicari:>onftic buffer (pi 1 9.0) m 
room tcmpcratuct. Unrcactcd dye vra^ 

iciuuvcU by p«UMi|(e uvcf « I'D^IO Scphti* 

dcx column. Plnally, the doubk-labclcd 
probe waa purified by prr.pttTnUvc Kish- 
performance liquid chroniato($r«ip}iy 
(Um:) ujiing an Aquaporc 224)x4.6- 
mm column with 7*p,m particle sisc« The 
column w» d«v€Jop<:d with a 24«niJu 
lincttT gradient of ^20% ttcctonUrllu in 
M TEAA (tricihy]oinlnc «c<-totc), 
JTobes are named by designating the se- 
quence Xcom liable 1 and the position of 
the lAN-TAMRA moiety, Vot example, 
probe Al-7 sequence Al with lAU- 
TAMRAat nucleotide position 7 from the 
•S* end. 



PCR Sy>lcim 

All PCK amplifications wrc performed 
in the Pexkin-Elmer GcncAmp PCR Sya* 
icm 96UU using SO-iU rcacUons that con- 
tained 10 mM Ttls-Ha (pH 8.3), 50 inM 
kCl 200 |iM dA'ir, 200 |iM dCll', 2U0 |u<f 
d<:m>, 400 pM dUTP/6»S unit of AinpEr* 
ase uracil N-glycosyUse (Pcdcln^Eimerk 



gene (nucleotides 2141-2435 in the se* 
qucncc of Nakdilino-]l|lnia ct al.)^'^ was 
auiplifltrd usliig piiiiiefs APl^ (Uid AK? 

(Table 1), which are modified allghily 
from tho»c of du Brcull cl al,^' AcUn am- 
plification reactions conta<ned 4 niM 
MgCJ;^, 20 ng of human genomic 15NA/ 
SO nM Al or A3 probe^ and 300 nM each 



TABLE 1 Sequences of OUgonucIcotidcs 



ptlnicr. The thermal regimen was SO*C 
(2 min), (10 min), 'lO ryclcK of 9.W 
(20 aec)> 60*C (1 niin), and hold oi 72*C. 
A 515-bp segniifni was afnpllfud froiii a 
plasmid iltat conaUiK ol a segment ol X 
DbM (nudcotldca 32,^?X>-32,747) in- 
serted in the Snwi situ of voctor pUCl 19. 
lliesc reactions wnialniHl ^.S him 
M);CU;2« ^ ofi of plasrnid DNA« 50 nM ?2 or 
PS |v6bc^ 200 nM primer V\19, and 200 
UM piiincr Ri19. The thermal regimen 
Waa 5Q"C (2 roln), 95*C (10 min), 25 ^y- 
CiC« Of iZO acc), src (J niln), onti 
holdat72t:. 



l<'or each amplification reaction, a 40^fU 
aliquot ol a sample was transferred to an 
individual well of a white,.9^w<ill n')icru< 
titer plate (P«Tldn-Ulaier). Fluorescence 
waa measured on the Pcrkin-Elmef Taq* 
Man LiS-SOU System, which consists of a 
lutnlncsconco apodromcitcr with plate' 
reader assembly, a 465-nTn excitation ill* . 
ler, and a ^I.S^nm emisftion filler, rxdla. 
tion was at 488 nin using a slit 
widih. Emission was measure at S18 
nm for 6-1'AM (the. rQp^»r(cr or U value) 
and .^ft2 nm for TAWllA (tbc quencher or 
Q value) using a lO-nm sUt -width. TO 
determine the hicicasc in te|fuit«i i.-mlS' 
ifiim that h «:uu$cd by dcavagc of the 
probe during I'CR, three normalirjitlona 
aie applied to Uie raw cmLVkion data, 
first, emission Intensity of a bulfei blank 
is subtracted for each vyaveleiigth. Sec- 
ond^ emission Intenfiity of the reporter Is 



Name 


lypc 


F119 


primer 


HI19 


primer 


pa 


.l>robe 


P2c: 


. complement 


PS 


probe 


r5c 


compUtueiu 


AW* 


primer 


ARP 


primer 


Al 


probe 


Ai^. 


completnenl 


AJ 


ptoi>e 




(Amipleuteul 



.SefjueitUf 



ACCCACAGOAACrCATCACCACrC 

AJxnxxicxriTCCOGCir.A<xrncntiC 
i<XK:A'iti>\crOAixx-nx;cc:AAccAC'Tp 

CnAClXiCn'(XX;CAA<X;ATCAOTAATGCCA'l'C 

(^uOAirrGcnXjcrrATcrATCACAACCATp 

n^CATCCITCrrCATAGATACC:AOCAAA'^^ 

TCACXXIACACrGTGCCCATCTACOA 

CV\<iCCKiAAtx:CiCrc'AIT(.iCCVV\rOU 

Ac:Atxu»ut;Aix:<:(jk'ix;c;cxx:A<;(;(»:Aiv\C 

CXKXXr!tS0AClTCCA0CAA<SAOAT|. 

CrATTntnTOCTCm\C5TCCAGOGCC5AC 



For each oUgonucicutidc used in iliis Muayi the nucleic add scqueoa* is givcin writtui in the 
&' » 3' dinxiioii. Tbeie are Ihrer types of al%uniidcoiideai PCR pf Imeri fluorogenic probe used 
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fiCURt 2 Rvsuiu o( S* tiuvlr«oc fi*»«y 4 4«i.(>aru\g ^A^Un jirf<^bcs with TAMRA at dif^ermt mide 
otkU iH>5itions, As doscrHwid In Materials and Methods. W3t NmpUAcatlnnj^r containing the In^ 
dicdied piobes wtre performed, and the fluiirtiuwcncc emission was m««sur«d ftt 518 «nd 562 nm. 
ftepottfU vflUu-* arc the average:*' 1 |i,o» for six reactions nm without added template (no temp.) 
and Six read Ions run wtth template ( i tcniP')- 'Hh' HQ ratio was ccilculated for «ach indtindus} 
reaalon and averaged to give tlie leporteSTRCj' fln^ MQ* values. 



OlviUcU by the emlsislon intensity of the 
quencher to give an RQ ratio for cadi 
reaction tube. This normalizes for well- 
to-tven variations in probe mnr«ncra- 
tion and nuorescence measurement. 
' nally, IS caictilaied hy subtracting 
tiic KQ value oi the no-ttsmpliittf cxmtro) 
(RQ"~) from the KQ valuts (tir Uitf vum- 
plctc reaction including template 
(RQ'). 

RESUtTS 

A sencs of probes witti increasing dis- 
laaces between the nuorf!»ct.*in rvpurtci 
and rtiodamlne <pencher were tcvcetJ Xu 
invcsngatetheTninimum and maxininTn 
spacing that would give an acceprahle 
performance in the 5' nuclease rCR as* 
wy. Tncse prubes nybdduc xo a target 



Aequencc in the human p-actin gene. 
MguiC 2 shows the results of «f) cxpcri- 
nicni ill ivliich these probes were tn* 
dudcU in pen thai aoiipUfled a segment 
of the gitiitt (xinLaining the taigct 

sequence. IV^foMfiaiice Ui Ihe S' nu- 
clease I*CR assay Is ntoriltAueU liy the 
in<ignhud« of AKQ^ which b'A measure 
of the Increase in repurier ntiurot^nw 
i^udcd by PCR ampll^catiQn of the 
probe target, PrubeAl-Zlwivii 4KCi value 
that 1$ close to rmt, Uidlcatit^g that the 
]3robe was not cleaved appreciably dur- 
ing the anipllf leal toil rcitUum, TJiii* sug- 
Kc:>l» that with ihc ^jucn^hcr dye on die 
se^xjild ttucleoUdc ^rnm th<ii»' end, theft 
Is lusufricieiit icKini fni Tutf polymerase 
to cleave efficiently between tlw reporter 
aiiU ^uendiei. The other Hvc probes ex- 
hibited comparable: AKQ values Chat afe 



clc4tr)y different from zero- 'ilius, all five 
prolxrjt arr befng cleaved dwring 1*C:K am- 
plJf)C4tiun ie>uHff}g iti a shidiar |iicr<:d^tt 
in leportcr lluuiescejiw. U slujuld l>e 
noted that complete digcj?tibn of a probe 
prodtices a much larger inCTcasc in re- 
porter fluorescence Uian that observed 
in Plgurc 2 <dau not shown). Hius, even 
in nMCtions where amplification ocx.*urs, 
the ma}onty of probt; moJccules rviiialii 
uiidcaved. K is mainly for this reason 
that the lluoresceace intensity of the 
quencher dye TAMJU changci Hltlc wllh 
ampHflcatlon of ihc targfu. I'hls i.^ what 

allows us to use the 662-nni fluorcuccncx 
reading as a normallMtlon factor. 

The niagnfhid#» nf Rf^* depends 
mainly on the quenching efficiency in- 
herent in the specific stnicture ol the 
probe arid the purity of the oligonucle- 
otide. Thm, the larger KQ' values indi- 
cate that probes Alrl4, AJ-19, AV22, and 
Al-26 probably have teduced quenching 
as compared with Al-7. Still, the degree . 
of qucnclUng Is sufficient to detect u 
highly slgnifleant ImTcuiK^ In reporter 
fluorescence when c4eli uf these ^>rol>e5< 
is cleaved during PCR. 

To further investigate the ability of 
1AMKA on the 3' end to quench (J FAM 
on the 5' end, three addiUonal pairs of 
probc5 were tested in the 5' nueicose 
PCR unsay. Fot each pair, one probe has 
TAMRA attached to an intcmol nude- 
iilUk and U'te othej has TAMRA attached 
to the 3' end nucleotide. The fcsulls arc 
shown In labic Vor ait three sets, the 
ptobe with the 3' quencher cxhit>Us u 
dRQ value thai is kvnsiUerably lil^Utu 
ilian for llie probe with the Internal 
quencher, The RQ' values suggest that 
difCecences in quenching arc not as j^rcuk 
as those observed with some of the Al 
prolyes* These results demonstrate that Q 
quencher dye on the 3' <?nd of an oligo- 
nucleotide can quench efftciendy the 



TABLE 2 Results of b* Nuclease Assay Comparing I'robns wiilji TAMBA Attached to ah Internal or 3'<termina1 Nucleotide 



Slgnni j>ll2nm 



Trobe 


no temp. 


+ temp- 


lU) iciup. 


+ icmp. 








A3*6 
A3-24 


54.6 2. 3,2 
72,1 ^ 2.9 


84.8 X .1.7 
236,5 A n.I 


116,2:2.6.4 
84.2 * 4.0 


n!i.6ia;5 

90.2 'J- 3,8 


0.47 is^dxa, 

0.86 :l 0.02 


a73a.O.O» 
2.62 ±0.05 


u.2&±arM 

1.76 i 0,05 


17.7 
1^27 


82.8 2. 4.4 
113,4 = 6.$ 


3B4<0±34.1 
555.4 db 14.1 


iuy)x6.4 
140,7^8.3 


120.4 =r laz 
118.7^4,8 


0.79M0.02 
O.S1:tO.Ot 


X%9 * ai6 
4.68^0.10 


X40:« O.10 

3.88 T 0.10 


I'S^IO 
r5'28 


77*5 ±6.5 


244.4 a 15,^ 
333.6 St 12.1 


86,7 li 4;3 
m6±6J 


9S,a 6.7 
94,7 X 6.3 


0.89 A 0X15 


2.55 A 
U3:bai2 


1.66 A a08 
2.8910.13 



... t^u-^^ mttMk* mnA niiniUiions wcstT oeiiormed as described In Mutedal m<«<1 Methods and in the legend to Klg. 1 



tTOia aora 09I 6^6 XVd IS:n 2002/S0/2T 



From : Br^ft. 



PHCNE No. : 310 472 0905 



Dec. 0S 2002 12:17«M P06 




flu<K«tr^nr<* nf a «*prtrter dye on the 
ontl- T\w il<j(<ruc of qucncWng h 5uff»* 
cjcal for Oiiit type of ullgonuckotidc to 
be used as a probd in the A' nuclease PCU 
assay. 

To test the hypothCfiifi that qucnchini; 
by a V TAMRA depends on ihc flexibility 
oi 4]« <sli|$onuclcotido, fluorescence was 
meaiuiKKl fur firubus . in the Single- 
smftded and double stranditd st«itc$« Tft« 
hf<^ 3 rcporU fluures^ccnvc ob!(crv(:<l 
at S18 and 582 ttm. The relative degree 
of quenching is assessed by calculallug 
the RQ ratio. Ikir prot>CS with lAMRA 
MO nucleotides ffom the S* end, llierc 
!s little difference in the RQ values when 
arnip^tring^ slngle^slrandcd with double- 
straadcd uligonucleotldes. The results 
for probes with TAMibl at the 3' end are 
much different For these probes^ hy- 
brldiiatfoti to a complementary strand 
causes a dramatic Increase in mq. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
Stranded l>NA, Which prevents the 5' 
and 3' ends from bclnjj in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked U^* effect on 
quenching. Figure 3 shows a plot of ob- 
served HQ values fur the Al series of 
probes as a function of Mg^"* coricenira- ' 
tion. With TAMHA attached near the S' 
end (prohe A 1 -2 or Al-7), the KQ vahie at 
0 niu Mg^" is ojily. slightly higher than 
HQ at 10 ittM Mg^*. probes A1.19, 
Al -22, and Al»26i the RQ values at 0 miu 
Mg*^ are very high* Indicating a much 



reduced quenching efficiency. Par each 
I if these probes^ thftie h a marked do- 
Cfctisc in UQ at 1 ynw Mg' * folUiwcd by 
u {gradual decline as the Mg* * iwuccu- 
trotion increases to 10 mM. I'jube A1-S4 
shows Alt intcrmediAte RQ value at O mM 
M$'^ with II smdual dcdtne at higncr 
^K^'^ coai:eniitt(i4nis. In a low-sall en- 
vironment with no Mg' present, a sin- 
gle- jkimndcd oll^onucliTOilde would be 
expected to adopt Att extended UMifor- 
mat ion because of electrostatic repul- 
sion. The binding of Mg** Ions aas to 
shield the negative diarge o£ the pho.s- 
phate bflcId>onc so that the oUgojiucle- 
otlde can adopt conformations where 
thct end is close to the 5' end. Ilieie- 
fore, the observed Mg' ' effects suppnn 
the notion that quenching ol a 5' le* 
poner dye by TAMHA at or near the 3' 
end depends on the flexibility of the of I- 
gonucieoride. 

DISCUSSION 

The striking finding of this study is that 
il 6eems die rhodamine dye TAMKA, 
placed at any position iu an oU|;onUClC- 
ntidc. can quench the fluorescent emis- 
sion of a nuOK*scein (0-FAM) placed at 
the S' end. This implies that a single- 
stranded, double-labeled oligonucle« 
otide must be able to adopt conforma- 
tions where the TAWRA la dose to rhe 5' 
end. It should Ije noted that tlic decay of 
^•I'AM In the excited state requires a cer- 
tain nmount of time. Therefore, whai 



TA&IC 5 Comparison Plunre,\ccn<c }>iui>^oxis of iiin2;l^;iTrandCd and 
I>ouUl«-jiir«nHcd Fluorogenlc ^fdlle^ 
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509.3a 
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0.tll 
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39J3 


165^7 


0.43 


0.3» 


A.V24 


3Q.05 
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07.77. 


140.25 


0.45 


3.21 
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70.1,1 


.Vt.63 


121,09 


OM 


0.58 






>20.47 




61,13 


0.61 


SJi5 


l*S-10 




144.a5 


61,?5 


165.54 


i}M 


0.87 




33.65 


4^.29 


7A.a9 


1<H.«1 


0.45 





(») Slnglc-stranded* llie ftuorescence emissions at 518 or 582 lun for solutums rontalnlng a final 
concenmnlon of 50 nM indicated probe, lo mM TtI^i id (pit lu^l SO oim KG« and 10 mM U%C\^ 
(ds) Dmihte-imndpd. Th« solutions contained* in addition, 100 iiM AlC ibr pnifars Al-7 and 
A)*Z6. 100 mi A3C for prc>bes A3-6 and A3-24, 100 nM l*2<: k« \m'Am rZrJ and J'2*77, or 100 ma 
r$C fw psolMsr P3.t0 And r5-2M. Hcfnrc mc atfdtifon of Mxi^'^iyi 120 lU Of eaclj >am|}l« was Itcatcil 



mattcm for qucocliliig i^ not the averagt 
distance between and TAMRA 

but, rathei; how close TAMKA can get io 
6^MM during die lifenme Of the 6-FAM 
excited state. As long «u> ihc dcc<sy time of 
the excited state. K« relatively long coin- 
pared with Uie molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMFA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMKA Is in 
proximity U) 6*KAM often enough lo be 
able to accept energy transfer from an 
excited 6*FAM. 

Deuils of the fluorescence me^surc^ 
mcnts remain puzzling. Tai examplc/rji> 
bk 3 shows that hybridiattitiun of probes 
Al-26, A3-24, and P$-28 to their comple- 
mentary strands nor only causes a large 
increuise in 6»FAM fluorescence at SlU 
rnn but also causes a modest Incrciisc in 
'I'AMRA fluoresccna* at S82 nm. U 
'l AMRA is being exdred by energy trans- 
fer from quenched d'YAlA, then loss of 
quenching attributable to hybridization 
should c3use a dccre8.se In the fluores- 
cence emission of TAMRA, 'ITic fan that 
the fluorescence cmi.'ssion of TAKfRA In- 
creases indicates that rhe. situation is 
more complex. For example, we have an* 
iKximal evidence thar the bases of The 
oligonucleotide, especially ii, quench 
the fluorescence of both d-FAM and 
TAMRA to some degree. When double' 
stranded, base-pairing may rc^duce the 
ability <jf the bases to quench. Ihe pd- 
inaiy factor causing tile quenching of 
6-FAM in an intact probe h the TAMBA 
.dy€. Kvfdcnce for the importan<:e of 
TAMUA is that 0 FAM fluuuiscence 
remains relaiively unchanged when 
prober IuIksKxI uuiy with 6«FAM are used 
in the $' nuclease I'CR assay (data not 
shown). .Sect>ndary electors of fluores' 
cc-nce, both before and afiei cleavage of 
the probe, need to be explored furilicr. 

J^egardlcss of the physical mccha- 
nbrm, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
rCR assay. There are three main factors 
Uiat determine, the performance of a 
doublt'labvled fluorescent probe In tlic 

nuclease IKiJR oway. The fic^t factor Is 
the degree of quenching olwerved in the 
imaa probe. Thb Is characteriswd by the 
value of RQ* , which Is thc ratio of re- 
porter ta quencher fluorescent cmis 
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FIGURE 3 VM^i d»f Ms'^'* t^i^centrAtiAn an RQiatlo for the Al aeries of prob«s. Ilie fluorenceiu*? 
emission InUnsUy ai 518 and 582 nm was measured far Milutiimx containing 50 nM pfolx^j 10 mM 
Tris-HO (pH 63), 50 mM KCJ. »nd varying aniounu (CIO mu) of MgOa- ITxr o*lailatc<l 
ratios (^<il6 nm Intensliy divided hy 5H2 nm ititi«ii2sfiy) art- iiluttvd Vil. MgCU concentration (mM 

Mk)< I^u; k«y {i^fjmf t^g^ ^Uit\y> lUv fiiutjish vaiuuimuiL 



dycA tised, .Spacing between reporter Arid 
quencher dyes, nucleotide sequence 
context cffecUj pccacncc of ^Irudurc or 
ulhet fttClurS Uiui reduce n«xlbilUy uf 
the oUgonocieotlde, and pudly of (he 
probe. The >cwnd factor ix th<^ cffidc.ncty 
xtt hybcidu&aUoa, which depends Oil 
probe r^,, presence of 5Ccondary MruC- 
lurc In probe or tcmplAic, annealing 
. tempctalurc, and other reaction condi- 
tions. The third fdctor is the efficiency at 
' which Tag DUA, polyineraH* delves the 
, bound probe between the cepdrtcr and 
quenclitfr dyes. This cleavage is tlepcn- 
Jeul on H-t^ucncc complementarity be- 
tween piobe and template a5 shown by 
Uie ob^ivaUou that mismatches In the 
segment between xeponer and quencher 
dyes drasticiilly n:duix* the vlcuvai^v: <if 
probe." ^ 

Tht? rise in RQ* vaUics for the Al $e* 
ncs of probes seems to indicate tliat tiie 
degree of quenching tn rcdticcu some- 
what as the quencher is piaced toward 
die 3' end 'Ihe lowest apparent quench* 
ing Is objerved for prob€ Al-39 (see Fig. 
3) ralliet than for the probe where the 
TAMRA Is at the 3* end iAi^Z6). rhts w 
•understandable, as the conformation of 
the 3' end position w<mld be expected to 
be less restricted than the conformaticm 
of an Internal ptnti^on. In eff«t:t, a 
qufoichcr at the 3' end is freer to adopt 
cuxifonnatlons close to tlic S' reporter 
dye than . Is an internally plaoid 

9T0ia 



prol>es« tite interpretation of HCi values 
is Ies» clear-cut. Ilic A3 probes show the 
same trend as Al, with the 3' TAMRA 
piobc having « lai^^er RQ" Uian th« hk* 
ternal TAMRA probe. Tor the V2 pali/ 
lx>tli probr,,^ havr ahoul the same RQ" 

value. Foi the PS probes, lite RQ for ilic 
3' probt U than f^f the lotci-oaUy 
labeled probe. Another factor that may 

explain ipjnc of the oKwTVctd variaHon is 
that purity ftffcctj the RQ" value. Al- 
Ihutl^h 4li pmbcs «re Hi^lXl putificd, & 
snr^ali omounf of conlramlnation with 
unqticnchcd reporter tain have a large ef- 
fect on HQ . 

Altiiuu(;h then: ni&y be n mod^t ef- 
f«*i cm dearcc of quenchll\g, the posi- 
tion of tlic quencher apparcfitly uin 
lidvc a large effect on the cffidcnry of 
pi'obc cleavage. The moxt drastic effect is 
observed with pnibe Al-2, where place* 
ment of the 'iV\MRA on the second nu« 
cieoliUt: t«dm^ the efflcleney of cleav- 
age to almost rcro. For the AS, 1*2, and PS 
probes, ARQ Is much greiiiw for the 3' 
TAMRA prohtts as compared with the in- 
ternal TAMHA prober Tills is explained 
most easily by assuming tlml piobe^ 
wUli TAMRA at the 3' end ore more likely 
10 l>e cleaved Iwtwmi icpoiie^ and 
quencher than arc probes with TAMRA 
attached ititeniaUy, For the Al probes, 
the dcavage efficiency of probe Al-7 
must already he quite high, as ARQ docs 
not Increase when the qucncner is 
ff>l»r«tfi rio«cr fn thp ^' end- This illus- 



trates the importanr<» nf hoing ablr to 
use probes with » quencher on Uia V 
end in the S' nuclease I*cu afisay. In this 
assayi an increase In the intentiity of re- 
porter fluorescence . is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
Uiv lupculur anil qutfiieUui dyu& uii Uto 
opposite endfc of iin ollgonucleoiWe 
proljc, any cleavage that occurs will be 
detected. Wlicn tlic qucnclicr is uitaeliecl 
to uii Uuuriiul nudvotlUu, Boinutlinc's tliu 
pfube work^ well (Al*7) a/id oihur tltiXes 

not so w«ii (A3-6). Tim relatively poor 
performance of probe A3<6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between tlic. 
rt^rtpr and quencher. Thej'efore, the 
lM!st Chance of having; a probe that reli- 
ably detects <iccuniulatioii of PCH prod- 
uct in the 5' nuclease 1*C:k assay is to use 
a prohc wirb the reporter and quencher 
tlyes un uppositu cind^i. 

Plocing the qu<:Acher dye on the 3* 
end may also provide a sligiit bonefli In 
terms of hyhfidlwition efficiency. 'Ilie 
presence of 0 quencher attached to an 
Internal nucleotide mlf^lif l>c expected hi 
disrupt lM3e;^t»rin^ and' redttce the T„, 
of a pfol>e. in lud, a 2f*i)-TCl r\%i\tvA\i.»n 
in Tft, has been obscivcd for two piobcs 
Willi ijUtni'ially ttlUdicdTAMUA^.'"^ IIHs 
disruptive effect would be minimised by 
placing the quencher al the 3* end. Thus, 
probes with 3' quenchers might exhibit 
»Hfthtly hifthe-r hybridization cfficicncjcs 
(httil plubeS Willi tnteciioJ queiiclien. 

The combination of increased cleav. 
age and hybridisation efficiencies niean.s 
that probes with 3* quenchers probably 
will be more tolerant of mismatches t>c- 
twccn probe and target as compared 
wtlli jnlernally labeled probes. Tins tol- 
erance of mismatches can be advflnta- 
geous. as when trying to use c kinglc 
probe to detect PCK-amplificd producte 
fnitn .v»ifp1e> uf diffeienl species. Also, it 
mean's that cleavage of probe during PGR 
is less sensitive tu alicrdJiionik in ad« 
nealin^ temperahire or other reaclion 
conditions. The one. application where 
tolerance of mismatches may be a disad- 
vantage is for allelie discrln^lnotlon. l^c 
ct al/^* demonsuated that allele-spedflc 
probes were Cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
hnnian cystic fUna'sis allele from the 
AFSds mutant. Thtnr probes had TAMRA 
attached to the seiventh nucleotide from 
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FiouRE 3 effect of Mg^ * concenmrtiun on ftQ raUo (or tuc A] series of probes, ilic AuumKMicii 
emissiun intensity irt 51ft and 582 nm was measured for solutions contatnlng 50 om probe. 10 mM 
TfiSplia.(pH 8J)« Siy XTku KCi, and varying amounts (0 10 xum) of Mga,. Ilir calculated RQ 
latkis (518 nm Intensity divided 5H2 mn Intensity) arc plotted vs. MgQi conccnmiti"« (f«M 
MiO- The I;^ {upper Hfi^t) shows the probes examined. 



dyc5 \i5cd^ spacing between reporter and 
quencher dyts, nudeoUdc sequent^t 
context effects, presence ol struclure or 
other factors that reduce flexibility of 
the oUgonucieotide, and purity of the. 
probe. The second tactor is ine efficiency 
Of hybciai7.ation, which dcipcnds on 
probe T^, presence of secondary struc- 
ture Iri probe or template^ annoaling 
tcmperatur«, and other reaction condi- 
tions. The third faaor Is the efficiency at 
which Toq UNA poiymei^ cleaves the 
bound probe between Ihc reporter and 
quencher dyes. This clcaviage is depen- 
dent on sequence complementarity be- 
tween pral)6 and templattt as shmvn by 
the otomatlon that mismatches in the 
segment bet%wcn reporter and quencher 
dyes drasdcally reduce tlie cleavage of 
prohe.<*> 

The rise in KQ values for the Al 80- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe A1-19 (sec Fig. 
3) rather than for the probe where the 
TAMBA is at the 3* end (Al«26). 1t)is is 
•understandable* as the conformation of 
the y end position would be expected to 
be less restricted tixan the conformation 
of an lt)tcmal position. In effect, a 
quencher ar the 3' end is freer to adopt 
conformations close lo the 5' reporter 
dye than, is an intemally placed 
quencher; For the other three sets of 



probes, (he iiiturpretatkfa of RQ' values 
is less clear-cut. Tlie A3 pr«b«s show the 
same trend as Al, wlU) the 3' TAMRA 
probe havinjj a larger RQ"' fhun (lie* in- 
icrnal TAMRA probe. For Uie 91 pair, 
buth probes have about the same RQ 
value. For the probes, the RQ' for the 
3* probe is le:»s tJwn fw tlic Intcii^ally 
labeled probe. Another factor that may 
explain some of the observed variation Is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of cnntaimlnatlon with 
unquenched rcfxirter can have a targe ef- 
fect on RQ . 

Although there may bc ft modest ef- 
fccl on degree of quenching, the posi- 
tion of the quencher apparently can 
have a targe effect od the efficiency of 
prQbe deavagc. The most drastic effect is 
observed With probe Al*2, where place- 
ment of the TAMRA on the .wond nu* 
clex>t»de reduces tlie efficiency of cleav- 
age to alrnosi ^cro. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMitA proheji as compared with the in- 
temal TAMRA probes. This Is explained 
most easily by assuming, that probes 
with TAMRA at the 3' end are mote lllccly 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached Internally, for the Al prolKS, 
the cleavage efficiency of protic Al*7 
must already he quite high, as ARQ does 
not increase when Hic quencher Is 
placed doser to the 3' end. Hits illav 



tiatos the importance of l>ciitg a))lc in 
use probes with a quencher on the 3' 
.end In the V nuclease t'CK aasoy. In this 
assay, an tnereaite In th<» intensity of t^. 
parter fluorescence 1? ot>it«rv©d <jnly 
when the probe is cleaved t>ctwccn the 
reporter «nd quencher dycj. hy placing 
th<* reporter and quencher dye« on tha 
oppocUc ends of an oligoiiiidcotidc 
j/nil>*f, aijy rleavii}^<^ \\va\ ovi-tifi will In*, 
detected. When the quencher \k att&ched 
to ;if) Uuc-iaal nuclc'Otlilei frvinoilnios tlic 
prohe.work.^ well and oihe.r times 

not 4K» well (A3^). The rclotlvcly iKJor 
performance of prnl>e presumably 
means the probe is beltiK cioavcd 3' to 
the q\icnchcr mthcr thnn between the 
reporter and quencher, 'I'h^«{ofe, liie 
t>e^ chance of having a probc that reli- 
ably detects dccuuiulHtiun of IK.^R prod- 
uct in the 5' nucteaiu^ PCR assay Is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms o* hybridization efficiency. 'Ilic 
presence of a qtiencher attached to an 
hitemal nuvlcuUde inl|»hlbe expecicd to 
disrupt base-pairing and reduce the r,^ 
of a probc. In fact a 2*C-3''C reduction 
ill has been ObSCfVCd for two probes 
with internally atUched TAMRAs.^'' I his 
disruptive effect would be mlnlmlced by 
pladng the quencher at the V end. Thus, 
probes with 3' quenchers might cxhibli 
slightly higher hybridization cffidc.nc.l«s 
than probes with internal quenchers. 

1*hc combination of inaeased deav* 
age and hybridization cffldcncles means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween protic and rarget as compared 
with internally labeled probes. This tol* 
erancc of mismatches can be advanta- 
geous, as when trying to use a single 
pfobc to detect PCR-amplified products 
from samples of different species. Also, It 
raean3 that cleavage of probe during PGR 
is less sensitive to alteratloiu In an- 
nealing temperature or other reaction 
conditions- Tlic one application where 
tolerance of mismatches may be a dlsad- 
vantacc is for allelic discrimination. Ix^e 
et al.^*^ dcmonsuatcd lhat allclc-speclfic 
probes were cleaved between reporter 
and quencher only wlien hybridized to a 
pi^»fcctly complcrnentary target. This al- 
lowed them TO distinguish the normal 
human cystic fibrosis allele from the 
^OiB mutant. Their probes had TAMRA 
attached to the seventh nudeutldc from 
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ihtA y end and wojo <l<«i;l9ncdl so thai any 
mUtimtchcs %vcrc between the reporter 
AAcI tjuenchcr. Increculng the distance 
betwoon ra|KYrt«r and €|u«nehAr wo\il<i 
le&scifi 11)4! clls^pUvc effect of mb- 
matches and allow cleavage of the probe 
on the incoEzoct targut. Thuf, probes 
with ^ quencher Attached to an internal 
nudeoHde may stlU be ucofal for allelic 
dl»vliDloatlon. 

In ihU study lo4t of quonchlng upon 
hybrtdUatlon vnwi used to show tliat 
quenching by a 3* TAMUA in dof>cndont 
Oft tin: HcxlbUlty of a slngle^xo'andecl 0)1- 
g6nudt!otid«, The Increase in reporter 
fluomcericc iiitcnyity, though, could 
alto b« used to ddtcrminc whether hy- 
brldlzaUon has ocoirrcid or nor. Thus, 
oUgoouclcoUdcs with reporter and 
queftchef dyes Attached at oppostic end* 
should also be u jeful as hybridization 
probes. The ability to delect hybrldl^i^- 
Tlon In real time means that these probes 
could ha usixl ro measure hybridization 
kinetla. Also, this type of prObe could be 
used to develop homogfincoiis hybrid- 
iration auayi lor dlAgnostlc& or other ap- 
plications. Ba^cll Ct al/^^^ describe just 
tins type of homogcneoiis assay where 
hybridization of o probe caust^ uii In. 
crease In fluorescence c&uacd by a losa of 
quenching. However, they utilized a 
complex probe dcsi^ that requhxts «0d- 
in^ nuclcotl^les to both en<b of the 
probe scqu^foci! to form two imperfect 
Haiipuis. The results presented here 
dciiioisstiate cluit the simple addition of 
a rcporuir dye tt> one end of an oUgonu* 
cicotlde and a qucnclier dye to Uic oUtui 
^ni gcnerat&a a fluorog^lc probe that 
can detect ls)«bridlzation or VCiK atnpUil- 
CHtion. 
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Real Time Quantitative PGR 

Chrlsti<in A. Heid>' Junko Stevens,^ Kenneth |. Livak,^ and 

>; Mickey Williams^ 

^BicAnalytical technology Oepmlnient, Ccnentcch, Inc. South ^^"l^^^^^^^^ 

4ppll€Cl nioSysiems Diviiiun of Perkin nm«f Corp,, Fo^tftr City, C^Wornto 94404 



Wc have developed a novel 'Teal time- quamUatlvc PCR ..cUiod. The "^f^^f^. "^^^^^^^^ 

acaimul.non ihit>ugh . duaWabd^d nuoi«entc probe Lc. TaqM^n ^J; JJ« ^^^^^ rSl Un^e iS 

acairace and reprodudfale quanllutfon of gene copies. Unlike oth^r quanlilallVft PCR ^^^^^^^^^^ 

dS Z ^u\^ POM^PCR sample handlliifi, prcvenUng potential PGR product carry-over <^ontaml»3iion ai^d 

Sulria, ITmuch^^er and higher rliroughput assays. The r.aMlme PGR "^<^h<>f'^^^ 

ranee of staning target molecule deiermlnatlon (at leau Ovc orders of magmtudc), ReaMlme auantltaflvc 

PGR \s txiremcly accurate and less labornntenrfve than airrenc quantirarive PGR methods. 



Quantitative niideic acid sequence aiikiysis Has 
had an important nile in many Oelds of hiologi- 
ail research. McasuKMTiciit of geiit expression 
(RNA) lias b««n uf^d cxTciisivcIy hi nxJuUoring 
biological responses lo various stimuli Clan ct al, 
1994; Huanft ci aL I995a,b; Prud'hommc el al. 
1995), Quaniiiatlvc gene analysis (r;NA) has 
iH-cn used io d«i^i*mtne the gcnunte quantliy ol" ? 
|.xirticular gene, as in the case ot the human HM2 
gene, which Is amplified in -SWi of brc,-ist Hi- 
mors (Slamofi tn al. 1987), CJcnc and genome 
quantitiition A and RNA) also have been tised 
for analysis of human iinjnuni>dc(iciency virus 
(HJV) burden dcnionstraUng changes in the lev- 
els of vlnw throughout the different phase-s of the 
disease (Connor t;t aK 1993; Platak et aK jvv;sh; 
Kurtado et al. 1995). 

Many methods have heen dcscrlhcd for the 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern 19/6; Sharp ei 
al. 19K0; Thoujiis 19«()). Recently, PCK lias 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCJR and reverse transcrip- 
tase (KO-PCR have permitted liic analysis of 
minimal starting quantities of nucleic acid (a?; 
Utile as one cell equiA'alcnt), This has madf. |k>s- 
slble many experimenls that could not have been 
performed with tradidona! methods. Although 
PGR Iws provid<fd a powerful tool, it is imperative 



AuthAr. 



that H be uwl properly for quaniilutlon (U««y. 
maeker:* 1995). Many early r<rjK>rts of quantita- 
tive: PCK and R'l-PCR described quantitation of 
the rCR fjroduct but did not measure the Initial 
target s<xiucncc quantity. II is essentia] to design 
proiKi controls for the quantitation of the initial 
torget sequences (Pcrrc 1992; Clcmenll ct al, 

100?) " 

Ke.N<:Ufcheis have developed several methcjds 
of quantjt;itive I'CK and RT-PCR. One approach 
measures i'CK prfjduct quanthy in the Uig phase 
Of the reualon before the plateau (Kellogg el al. 
1990; l*ang et a), 1990). This mciho<J reqtiircs 
dial each sample has equal input amounts of 

. nucleic- add and that each soirijilc under analysis 
amplifie-s whli idvnl ic^l efficiency up to the fxiint 
of quuuiitarivc analyds. A gene sequence (rnn- 
tained hi wH samples at relatively constant quan- 
tit;^, such as p-«clln) can be used for sample 
ttnifliTication efficiency normali23tion. Using 
conventional methods of PC-R detection and 
quan tita lion (gel electrophoresis or plate capture 
hybHdl^ation), it is extriemely laborious to assure 
that all samples are analyzed during ttie log phase 
of the rciKi^on (for Ixilh the target gene and the 
nornializati<tn gene). Another method, qwantUa- 
tlve competitive {QC)-1^:R, has iKen developed 
and 1:5 tised widely for PC:R quantitation, (JC-PCR 
ntllcs on the inclusion of an Internal control 
competitor in each reaction (Becker-Andre 1991: 
Hatak cl ol. I993«,l>). Thv^ efflcioni:y of each re. 
action is nortualhscd to the internal compel iior. 
A kimwn iiifuuint of Interna J competitor ean be 

annrv men ooi WJ «c.:*t rnft7/cri/7T 
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acidod tn cmch ftumplc. 'Vo obloin relative n"«ni- 
tQtion, the unkiUiwn target PGR pKKlurt is ctnu- 
j)9rcd Willi ihc knov«n 'coinpvtiU»r iK'M fncuiuct. 
.Success uf a cjuantHaKvc cumpcrilivo I'CU assay 
relitK <>n acvdoplng an internal UMitn.)! Ilml am- 
|iiirii*:s wUh U\c same tiffideiicy «n iliv umh^i "^oi- 
cculc. TllC design of tUv cuzupctltuf «nU the vo}i- 
dation of arripJIflcailon cfficicjit^<r> icqulrc a 
<!i:dicated cffurt . Huwevci', l>ccnu5C QC:-IK:H d<K8 
notrccjidK* ilwt PC:K pxiKluct^ be anal/zed during; 
the loic* phaM.* of thi^ uiitplancalion; It !$ Itiv iiii.sWr 
vf th« two methods to visc« 

5evenil dettfctkin MyatiVKu uic ii>^nl for quan 
UUtivc 1*CK «iu1 R1-rc:tt analysis: <3) ai;i*rtx*!0 
gvU, (2) fluunri<.xiii labrHnK K^U pioducis ajid 
dclccilon with l«.'»4?r-liKliicrd flu<jrc.Hx-ncc uaUi^ 
capillary cli:<:trc>pli<)Tc;»ia ct al. 1995^ Wil- 

liams CT al. 1996) or acTylaiuidc gcKn, and (3) I'laic . 
capture and .sandwlcii probe liybrlJl^yiliiirt (Mul- 
der ert al. 1994). Although tilCSC HIVthtKiN jmivrd 
StiCCCSSful. each metlKJcl rt»qulrc!{ postO*CR ma- 
nipulations that acid Thii^ Ui the aiialyMs aitO . 
may lead (u Jwbufatwiy tunlrtuiiMiation. The 
sample ihfuugliput uf Ihe^e jnrlli<iU> i.> limited 
(wiUi the exception of the plate capture «p- 
proiK'h). «n<l, llutr^forc^ thoc methods, ore not 
well >uHc:d fiu uSc^ dciHandjji^ bigh sample 
Throughput (I.e., acreeithiK ^*y^H^ nujiibers of 

bU.*iluilc:v.iAlv» \n <xiia.lyKlit{^ *^Ainj/tva Cut dia^ilc»- 
tlc^ or clinical IriaKs), 

More wc r^Tjwrt th<: dcve.lopiiuMit of a novel 
itiftxay for quantitative DNA anA]y:il.i. Tlie astsay is 
ImstKl on t)i^ usr. of iht! 5* nueU*asv artxay first 
described by Holland el al, (1991), Die luethod 
u,w i!i<! 5' niiclca.ic. activity of 7Vi</ f>t>lynicra.nc to 
dcavc a nonc^ctcndlblc hybridization probe dur- 

r>ic ^tension phase *7f. l'CK,T>u! «pproa<:h 
uses dual-labclcd fluorogenic hybridi/.atlon 
probes (Lcc ct n]. 1993; J^oaslcr ct «1. 1993; iJvok 
ct ttU J^9rio,b), One fluoreaccnV dyv .^crv^j n> a 
reporter IFAM (i*c., 6-carbQ3«yfluorei^'cin)( and il5 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (I.e., rj-carlxixy-ieifamcthyl- 
rhodaminc). Tlic niiclcase degradation of the by- 
hrldlxjitlon prohu relt'a^^es the quenciiiug of the 
VAM nuorcscciit etinssUui, fcsultii'tg )n an In- 
crease hi |M:ak fluorescent emission at 5Jtt mn^ 
Hie use of a se<|ucncc detector (ADI Prism) allows 
mcasurcmeni of fiuoresctint ,%|H:clra of all 96 wclh 
uf rh« tncrmai cycler continuously during the 
1X:K anjpiUlcatlon, •nu:n:ftjrc, ilie n:uetiuii» ttjc 
ifunilloreU ui real luiie* 'll^e output data is de- 
scribed and qu^ijninitlYc lumlysb uf hiput tdr^^et 
DNA sequences u discussed below. 



RESULTS 



PGR Product perccrlon in R«al Time 

'llic goal was to develop a higlvlhrotighput, son- 
jciiivv, nn<l nttt-uratc gene quani Nation asKay for 
U5c In monitoring lipid mediated tbftfapcuTic 
gene delivery, A plasinld Amending human factor 
VIII gene «?quftnce, p!?8TM (sec Mc*thods). w;is 
uscti ai a ukkWI ih<:ra|>eutie gwne The assay usr^ 
fluorescent Taqman mothodoJogy and an instru- 
ment uijjable of measuring nuorc.%'ena» in tcaI 
tunc (ABl Prisni. 7700 Sequent neicrlnr). Hu! 
'•|>tqMi;if« rojicilon rcqum*s a hybrldlzailon probr 
lalKled witli two different fluorescent dyes. One 
dye li a report dy« (I«AM>, the other is :r quench- 
hig dye (TAMRA). When the pruU: \s lolacl, fluo- 
lesccnl energy transfer occurs and the reporter 
dye fluorc*c.«tii emission Is absorbed by the 
quenching dye (TAMRA). During Die cxlctision 
phase of the PCK cycle, the Hiiorcscenl hybrid- 
tyjilloii prtilTc l< dcaved by tJie S'-.T nudeolyiic 
adlvhy of tlie T>NA polymerase. On cleavage of 
Ihc probe, the reportcf dye emission is no longer 
Iraanfcrrcd efficiently to ihc quenching dye, re 
suiting In an Jnerea,^e of the roportor c\y^^ fluorct- 
cent iTiiii.'ciloii t»j>ectf<i. VCfK primers and prubcH 
were dcMgninl f<ii lliu huinao faclof VllJ se- 
quence and human p-actin gene (a.% dtyicnbed in 
Methods). Optimization reactions were per- 
formed to choose the approprlute probe uml 
magnesium concentrations yielding ihe bigbe^t 
Ihteiviity'of re|5ortcr fluorescent signal without 
saerlflcing speclfKity. The in.Mrument uses a 
charKc-couplcd device (i,c«, CCD camera) for 
mca."siirlng ihe flu6rc:iccnt einiavsion speetni from 
ftlH} to C»$0 nm. ICach IK:u talx: was monitored 
s(,ijuentiftUy for 25 to.hcc whh c^mUnuous monl- 
toHng throughout tlit: aiuplifaeutitm. Uacli tube 
wa.n re-exanilncrd every tt*5 »ec. Computer soft- 
ware Y/an dc.'signed to exttinijir tlie fluorescent In- 
tensity of both the reporter dye (FAM).and 
ihe quenching dye CFAMRA). 'hic Ituore-^crtt 
intensity of the quendilng dye, TAMRA, cliangvs 
very Utile; over the course of the PCR anipUfl* 
cation (data not shown). 'Hierftforc, the Intensity 
*>f TAMRA Uye emission serv«* as an Internal 
xtandard with which to nottnulliwi the reporter 
dye. (FAM) emission variatjoits. Tl^e software eul- 
euloto u value termc*d AKn (or ARQ) using the. 
following equation: ARn - (lin^) (Rn"), where 
Kn-* • • efnl»»tUn iuicjisiiy %>t reiwrier/cmission in- 
tensity o^ quencher at any given lime In a reac 
tloii tube, and Rn r- emission intenaitily of re- 
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poncr/cniis5l07) ImvxiMiy i>f qucucWer mcflMircd 
prior 10 I'CH iimpniicaliou in ihar same n'iiaioa 
tuJK. I'or the purpose of quaiituati<ni, Uiv U^i 
tlirec data jioinis (ARn$) coUct-n:U during rrx*. ex- 
tiMisiOii step for eacti Vi.'M cyciu were analyzed. 
TJic oucleolyiic dcgradatum of the Jiyumliy-iiion. 
probe occurs during ihc cxtcnwuM phas^or ftat, 
and, thifreforc, rq)Oficr fluorescent ciiiuMun In- 
creases during xhU lime, Jiii: iluw daw pointn 
were averaged for cacli k:K cycle and the ujcHn 
value f<.ir cacli v/iis plotttid in an "axiipmicatJon 
plot" shown In ll^ure 1 A. Tlic ARn mean valwv Is 
ploUecJ on ihe j'-axis, and time, represented liy 
cycle numbtr, is plotted on thv^-axts. DtirlnRthe 
earJy cyclej; of the PCU ampIlfStation, tlie ARn 



value remains at ba$«if Jlno Whrn Mjfficleni hv- 
biidl'/MUm \wo\h: hfts hoen cleaved by Uju Tiuj 
. |>olymera«: nuHeiAAO activity, tli« ialeusily of ro- 
; pcjrlt-r fjucirc-tccnt cmUiafrm infirvawt*. Most iK'l\\ 
anipHrK4irionfi reach » plateau pha Re of reporter 
fiuuTCH-vi iV emission if the rcMcluui Is carried oui 
. lo high C7ek* uuml^ciN. The a/njilifiraUon plot \'J 
uxauihicd vaily lj) Ihii. ffcaclion, at a point lhat 
ic-prcscnts ihc- log phi«!e of prtHlud arnimula* 
lion. This Is done by ussignlng an arbiUary 
flircshold tiiji in b«M:d on the varuibilUy of the 
bas«.linc dMia. In Figure 1 A, tlic Ihfftshold was set 
ai JO standard duvwlioiiK aUivc the meJin f)f 
Via^e lln« eniis^Kiii tialculated froiu <.7^1^ 1 lo 1 S. 
Once the threshold is chosen, the point at which 
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ndure 1 PCR product detection in real «nne. (A) The Model 7700 software wHl consiruct amplificalion pleti 
from the extension phase fluorescent emission data colleaed during the PCR ampllflcaUon. Tl,e standard de- 
viation is determined from the data points collected from the base fine of the wrtpjlflcauon ploL vaiu^ are 
calculated by determining the poini ai which the fluorescence exceeds a threshold llmil (usually lo times toe 
standard deviation of the base line), (S) Overiay of amplification plots of senally (1:2) dUutfidliiuman g^^omic 
Ol^ wmolcs amplified with p-actin primers. (0 Input DNA concentration of the samples plotted versus u-f. /yi 
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the ainplificot ion plot crofiscft the thrc!thp1d'«vcl& 
fined as C,, Is rc{x>r(cd aft the cycle number u\ 
till?! ]K>inl. Ar will be demonstruttui^ lh« (V, .value 
I9 jiieUicLive of the quantity oi input tiir^ttt. 



VdJues Provld« a Quantitative Mea:turcmcnr< of 
ln|>ur Targer Sequencer 

. Plgurc IR shows amplificsittun pjais of lifd]>T<$K. 
eni rCR diMpJificaUons t>vtrM6. iTie omplt^ou- 
tions were porforj^ed on a 1:2 scilal dllutkira -caj: 
human genomic JWA. 'Ilic an\pHftcd targel wju 
human ^ octin* The amplification plohi Klnfl (o 
the right (to higher threshold cycles) n« the input 
lAfgol quantity \Si reduced. 'Jhij i^; expectod he^ 
(muftu r«mt:tlofiK with h\wvT 5;tnrtin^ copids' of tllO 
largct moloailc require grcrucr amplification to 
degrade enough probe to attain the Threshold 
fluorescence* An arbitrary threshold of 10 st^ri^ 
dard deviations above the base line was used to 
detennhie tlie C'-,- values. Figure IC reprcswits the 
Cly valuc^f plotted versus the $Hinple dilutjoti 
vaJuo. Each dilution was amplified in trlpiicalc 
P<:k amplifications and plotted a.s moan vahies 
with error bars representing one standard devia- 
tion. Tlic C r vtthicS dcxreasc linearly with Increas- 
ing target quantity. ThU5, Ci- vahurs can used 
as 0 quantiraTive measurement of the Input target 
numl7e.r. It should be noted that the amplifica- 
tion plot for the ]5«6*n^ samj^le shown In lUgurc 
Ifl does not reflect tht* same fluorescent rate of 
Increase exhibited by most of I he other samples, 
•me I5,6-ng sample also achieveK r.nd|'>olnt pla- 
teau at a lower fluorescent value than would he 
cxiKCtcd haM:d on the Input PNA. 'ilii.s pheiu)7TU 
cnbn has been oljscrvcd. occasionally wiili other 
sampler (data not shown) and may be aitrilmt- 
able to Itttft cycle inhibition; this hypculicsi.^ is 
still under hivcstigalion. It is important to note 
that the flattened sloj^e and early plateau do not 
irapaa signinc^ntJy the calailated C, value as 
demonstrated by the fli on Ihe line shown in 
Rj4ur« 1 C, Ail triplicate am|?lincaiic>n?i ntsulted in 
very similar Cr values— Uic standard deviation 
did ncit exceed 0,5 for any dlJutlon* IJiU cxiK'ri- 
ment contains a > 1 OO^OOO-foW range of Input tar- 
get molectdes. Using Cy values for quaniUation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
. quantitation. Tlie linear range.ol lluorcsccnl in- 
tensity measurcmfcrtt of ihc ABt rrlain 7700 Se- 



mont« over a very larfje t;«ij»i» «f rM:itlvr> ct;«rtlnp/ 
largM quantities. 

Sample Freparadon Validation 

Several parameters Influence the <-fUcU*"ry nf 
PC:r amplification; magnesium and salt concen.: 
trattoMS, reliction conditions (i.e., Iituc and Unn- 
pcrature), PClt target si7.c and composition, 
pnuicr sequences, and sample puriry. All of the 
above (actors are common to a :dnj|;lc VCR assay, 
except sample to sample purUy. in an effort la 
validate the. method of sample j)reparaUon for 
the factor Vlil assay, PCKampliliCOtion rcproOnr- 
ibility and olflclcncy ol 10 replicate sample 
pre}MratioTi.s: w»r« examined. After genomic ONA . 
waN j)repared from the 10 repllciatc samples, the 
DNA was quant Itatcd by ultraviolet spectroscopy. 
AmiHincallons were pcrfomied analyzing p-acUn 
X^'"^- ^«Titent In 100 ami 25 nj; of total >;un<w)iic 
DNA. Each K:K amplification was perfomicd in 
• triplicate. C«ainparisoii of Cl^ values for each trip- 
licate i^ample show minimal variation basird ofi 
standard deviation and coeflkient of varianct* 
(Table 1). iliercfore, each ol the triplicate PC:u 
amjjlifications was highly reproducible, dcfnon- 
strailng tliat real time PCK using this inslrumcn- 
tatlon introduces minimal variaiinn Into th« 
quantitative J'Cit analysis, C^jmiparisoii of the 
moan C'^ values of the 10 replicate sample prepa- 
rations alstj showed nihiimal variability, indicat- 
ing that each sample preparation yielded similar 
results for f^-aclin gene quantity, 'i'hc highest Cy 
difference between any of the Si^mples was 0.85 
and 0.73 for the 1(X) and 25 ng siimpU^s, rc^pcc- 
lively. Additionally^ the ampllflcaliou of cadi 
sample, exhihitttd an equivalent rate of nuorc.s> 
cent emission iiitcnsily change per amouni of 
DNA target analyzed as indicaied by similar 
sIojKS derived from I he sample dilutions (Fig, 2). 
Any sample containing an excess of a PCU inhibi- 
tor would exhil)it a greater measured p-acUn c;,. 
value for a given quaniliy of UNA. In addition, 
ihc inhibitor would be diluted along wiiJi lint 
.Nampic in the dilution analy,>i.N (Hig, Z), altering 
the expected Or value change. Each .sampje am- 
pJJficaticn yielded a similar result in th« analysis, 
dcmonstraiiiig that tliis method of sample prepa- 
ration is highly reproducible with regard to 
sample purity. 

Ouanciudve Analvsis of a Piasmid After 

7oen AQi aha wa «c:!fT 7nn7/cn/7T 
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Tsbia 1 , lloprodu<lbiUty of S«fftpl« Preparation Method 



100 ng 25 ng 



Sampla 
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no. 
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«C VMl 1,fVf 1 




7 


18.2^ 
18^3 








20.48 
20.55 












1«.27 


0.06 


0.32 


20.5 


20.51 


0.03 


0,17 


2 


18.33 
18.35 








20.61 
20.59 










18,44 




0.06 


0.3? 


20.41 


?0..^4 


0.11 


0.54 


3 


183 
18.3 








20.54 
20.6 










18«42 


18.34 


0.07 


0.36 


20,49 


20.54 


0.06 


0,28 


4 


18.15 
18;23 






- 


20.4,8 
20.44 










18,32 


18.23 


o.6^ . 


0.46 


20.38 


20.43 


0.05 


0.26 


5 


18.4 
18.38 








20.68 
20.87 










18.46 


18,42 


0.04 


0^3 


20,63 


20.73 


0.13 


0.61 


6 


18,54 
18.67 








21.09 
21.04 










19 


18.74 


0,24 


1.26 


21.04 


21,06 


0.03 


0.15 


• 7 


18^6 
18.36 








20.67 
20,73 










18^2 


18.39 


0.12 


0.66 


20.65 


20.68 


0-04 


0.2 


8 


18.45 
18,7 








20.98 
20.84 










18.73 


18.63 


0.16 


0.83 


20.75 


20.86 


0.12 


0.57 


9 


18.18 
18.34 








20.46 
20.54 










18.26 


18.29 


0.1 


0..4.^ 


20.48 


20J51 


0.07 


0.32 


10 


18.42 
18.57 








20.79 
20.78 








1 8.66 


18.55 


OA2 


0,65 


20.62 


20.73 


0,1 


0.46 


Mean 


■0 10) 


18.42 


0.17 


0,90 




20.66 


0.19 


0.94 



tor cx)iit;iining a parll4ii cUNA fur hujiKin factor . betw4i«ii :ifiy iwa samjilo moonie was 0.5)5 C,. Ten 

Vfll, pl-HTM, A $crk*S o( tK<nsfcction$ waA not nunogranisof lotulUNApf^ach sample were aiiso 

up using a <lccrca«ing amount of the plasuiid\40, cx.*)iiilri€:d for p-acUn. llic rcswlta «^ciju .showed 

4, 0.5, and 0,1 |jLg). 1\vt!niy«rour hours po.si- ih«l very siiiiUar aniounU of genomic 1>NA were 
tr<ifKir<;c'tion, local T5NA wi»i ptirlflrd from each . present; tiu: maxlmuni mean ]5 act)i> <";, value 

flask of wUa. (S*-Acluj ^wic qudu lily Wai cliux-i i « a diffcrcnct: wa.s 1 ,0. As l^'igurc 3 ahows, the rate of 

a value for normalihatH in of ^viivmic, DNA con- (^-actJn C|« uliuji^v Ix-twccn the 100 and 10-ng 

ccnrration iwiii wli aaiuplc. In llils c;ipciiiii<:nt, sajnx>lca was similar (^lo^yc vaJuci? rang« butwoon 
p-actin K^nc conicni should remain consiam 3,56 ana -3.45). Thla verifies again iha< ih'c 

relative to roral genomic DNA. Fljjurf 3 .^hows if jv niclho<l of .^lamplc prCfjaraUon yields sajjr»]>lt»s of 

result of the p-actln JDNA measunnnent <100 njj identical PCRintcgri^ (5,o-, no sampli^ cofU.iinod 

total DNA determined by ultraviolet spectros- an excessive amauut uf a PGR Inhibitor). TTftw* 

copy) ot each sample: Kach sample wa& analyzed ever, these results indicate that cacli sample con 

ill triplicat(j and the mean p-actin C:^ values of talned slight liiffeiences in the aciual amount of 

the triplicates were jjlotted (cnor bars represent genomic ONA aualy/xd. UctenninatJon of actvial 

<^ry^ rf'^^AtkMi fi*nn'M%rkn% 1 Hp tilphi*i5r ^lifformrp ^jeuoniic ON A s.oncent ration was accomplished 
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P^uns 2 S<3f i tple prepAraiion punty. 1 he reptlcotct 
samples shown In Tablo 1 woro <iU:o ampKfied \\\ 
tripicate vising 25 ng of each DNA sample. The fig- 
uie 5l*iow5 die input DNA concentration (100 and 
2S ng) vs. C, In ihi* \\ty\t^, ih<» 100 nnd ^5 ng 
polnis for each sample are connected by a line. 



I'jy plot ling the mean (^-uctio C, value obtained 
for uach IDO xi^^ ^Minpls; \n% a f»-«icthi st.-inil«ir(l 
curve (shown In JMh< 40). TIic eclunl {;cfiumic 
OMA concent r it < I each sumplcj tu was ob 
lalncd liy extrapolation U) ttiu X^axUi 

rii;urt: 4A shows tlic ixicaswrcU ((.tr., iitx**- 
normaU7««d) <f\i«iiallic:» uf /actor V]]) ploMnid 
ON A (pI^lTvl) ifrc^m each' of the four transient ccl] 
lr}f««^fectlr>n,v Each rcacUon contained 100 rtff of 
total 5<tnij>lc UNA (as determined by UV spcctroi^ 
cop/}. lUich sample wa.s uiialy2MrU in trSplicvtc 



.9ft 
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21 



20 



y27JS4<IMrrll-f 

y«'/.V2<r-S.4SxAr.t 




I09 (09 Inpul ONA) 

Figure 5 Anulybh of lidosfectcd cdi DNA qwonUly 
and purtty. I he DNA preparations of Ihc four 7^% 
cefl transfections (40, 4, 0.5, and 0,1 o^ pFSTM) 
were analyzed for the (3-actln gene* 1 00 and 10 ng 
(delermined by ultravtolel spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM thai was transfectcd, the ji-aciln 
Q values are plotted versus the total : Input DNA 



Rl Al flMl- OUANTITATIVH PCR 

•I>c:r< ;nnpllBcatk}r)9. As shown, pI'SlTvf piirificd 
ifxoic Jhc 203 colls decr^scj; (moan valuer in- 
CTU:t^i^; with decreasing, amounts <if plaicnild 
aruitsii-tHlcd. The me»ti values obtained for 
pW)*KI -in TigufC 4A were plotted on a standard 
curve comprlned uf s^'ilully diluted pKHTM, 
shown in figure 4R. 'Hie quaiiHIy oJ pl-KI M, /j, 
found in each of Uic four tranRfoctlour; wus de- 
termined by cxtrapoJatJon to the x axk uf the 
standard curve In l^i^urc 4B, 'Hutsc unairrectcd 
values, b, for pWl'M were noruiJ^ll^'Aid to dcicr- 
ininc tiic act u;*l amount of pl^l>^ fQaiiid lOf) 
riK of genomic DNA by uAing the equation:. 



/> X 10 0 nft 
if 



uctual pFfriTvl copies per 
100 ng <if genomic IwA 



where a actual |»ciiojnic DNA in u saffiplc and 
b ^ prft'lTvl copies from the standard curve. 'Hm: 
noi:inid»r«cd quantity of pCSl'M per 100 ng of ge- 
nomic DNA for each 0/ the four transfectlons Is . 
sfiown in Figure 4Ji, 'ilic^e rcr^uU.^ show thai the . 
quantity of factor VUl pla:>iulO iissivtlated wiili 
the cells, Z\ hr lifter ir«jusTv.:<.;lioit, di:i.iv:.iSi:.s 
with Ucc'rcctsluj; pltiMuiU vxjiH.tinLiatiou used hi 
the lfaii:»fcctio»u The quantity of pl'tJTM nsstwJ- 
atea with 293 cells, after irunsfcctlon with 40 ^ 
of pUiKniid, was 35 pg per 100 ng Rvnoiiilc I>NA. 
*niis results in -520 plasuiid copies per eell. 



DfSCUSSJON 

We liavc dcscribcci a new uieUuKi for quflnUlut- 
ing gene copy numbers using real-time analysis 
of FCR amplincatlonx. Real-time FCK \<> conij^ai- 
iblc with cJthtT of the two FCR (KT-PCR) ap- 
pru<tciie3: (1 ) quanlUativc conifJclitJvi: wiicrc an 
hit ei J lid cumpetliof for each target .sequence xa 
tisc-d for nomiahjsatlon (data not shown) or (2) 
quantiiaUvc comparadve l*OK ussJug a mijmuli^a- 
tion gene contained within the .^lample (I.e., p-ac- 
till) 0/ a "housekeeping" gene for iri-PCH, ff 
equal aaiounts of nucleic acid are analyxetl for 
each satnple and if the ampHflcatlun efficiency 
before quamltiitlve analysis identical for each 
sample, the Internal cuuliul (iiu«mcili#^HtlH>n gene 
or competltar) should give equal aiKnals fof. ali 
samples. 

The rcaJ-time PCU method (offers scveml ad- 
vantages over the other two methods (narrcntly 
employed (see the tntroducUon). Mrsl, the reaU 
time IHJR method is perfonncd in a doscd-tube 
system and. requires no pnst-PC^R manipulation 
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Fl0ur» 4 ati|intlt«(K/o HnnlyKic of pFSTM in transiccurd cclb. (.4) Amount of 
plasmid DN4A used for Ihc transfecilon ploxicd against tfii: iiKjuri C, vatue deter- 
")i2^J^'^?^^^^ rcmAln;*i£j hr alter Ironsfoction. (fl,Q Standard cun^rs of 
p^RTM *nd p-acUn, respccuVcly. pf BTM PNA </J) and gcr^orticc UNA (Q were 
dllutAd tAflally 1 :S before ^mpimc^>llon with the oppropnoic primcfi. The f*-actin 
standard curvo wak usod to norn><ili>o the results of /I to TOO ng of genomic DNA. 
(O) The amount of prSTM prc^eni per 100 ng of genomic DNA, 



of .^mplc. lliercforo, Iho poU'ntlii) for TCIt coji- 
<iimInatU>fi in the ]ah<)rMt<.iry is reduced bccaitNC 
amplified productR cam Iw ;*iiHl>r/.e<l and disposed 
of Without opening xiw ru;ixnion tubes. SecomI, 
Ihb methcKl MippoiU lh« umi 4*f a tiumi4i1i>'^lK>f] 
gene <Lc,, P-actin) for ^^uanihailve PCR orhouMr- 
kvephi^ genes for quiin(itotJvc Rl-l'CK controls. 
Analysis is performed in real time during the Jog 
phase of product accumulation. Analysis duflnR 
KfK phase pcnnits many different pnics fcver a 
wide input target range) to be analy^rd .ilmulU' 
iu!uusly, without concern of rcacliincL reaction 
plateau at different cyclo, Tim will make niuUl- 
$cn« analysb assays much cmMkm |m tJcvelop, l^c- 
cause individual intcrnci] i.unipc:tltui> will iu*l Ix: 
needed for each gene under analy»la/tliird, 
:»ainplc throughput will iui.ica>e dmntatUdJty 
with the new method hecauae there )» no |k>M. 
rCR proccwing, time. Additionally, wniking In a 
fnnnat h highly couipatihle with aut<>. 
uiatton technolo^, 

Hie rcai-tiuie 1>CR method 0^ highly repr**- 
ducible. Replicate, atnplifkations can be analysed 



for ^.ich sample nUniml;dni; j>otcnital <frr<>r. Ilui 
«sysiit»Tt ;ilU)ws i\>x a very large a^say dynamic 
ruijfto fapproacliing l,tKK},<K)0-f<tld Marting tai- 
g<fi), Ualn^ a .standord curve for th« t«rKCl ol in- 
tcrvst, relative cof)y nunibc:r values can be dcicr- 
mJncc! for any uakjiuwn -^anlpU^ HunrtJjctjni 
threshold values, C,v contdair. linearly with rela- 
tivc DNA copy number*. Keal time qu<;ntltativ« 
K'J^ rCH mcthodolfjgy (GIb>oii ct al., dils liwij<-) 
lia,i ahobecn d<ivc«lo|>cd. iMn^iUy, real iUnn qu^ii- 
tifativc PCIl methodology can Itc used lu deveJup 
high-throughput screening aaaay^ f«r a variety of 
api^licatlons fquantJtatlvc gene CAj'ieadiuii (RT- 
POK), f^cne copy a.i.^<iya (fieri, HIV, etc.), ^cmv 
^yp'^B (knockout mow^c analysis), and hnmuni>- 

rcuj, • 

Ucal-ti)n« PCAK may al.io lie |«:rfoTmcsl using 
intcrcttlaiing dyes (Htguci^i ct al- ^WA) sud^ as 
ciJiJdium bromide. The fluorogenic probe, 
method offers a mafor advantage (jver inter- 
ralaiing dyes—greatex specificity (Lc./ primer 
dimvrs and nonsptidfic PCR product."; are not de- 
tuned). 
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METHODS 

Generation of «i Plastnld Containing a P«rtlol 
cDNA for Human Factor YIll 

Totol RNA v<i» hiifvratcd (UNA*-^* U (nmi 'J'c«J Tcft, Utc,, 
fmit^YTOOdf T>i) frwiu o!Jl> li-»«»*fc<t«l a (actur VI H 
rK|irtMsluii y^ior, pC:|S2.lk-%5U (Katott vl u\. Octr* 
man d al. 1900), A CMClor Vlll fMrtlat cDNA ««of)U(*tuv WAS 

i^*fM«m«c«i toy irr t<:i««i«Amp la iTih ItNA 

(part Nttojwny*!, r£ Aiiplunl HHisy^ictns, l^»«ci <p«y, t:A)j 

ar^ shown below). Iln* ampllcoii w;^ reaniiilifH'tl iiMni; 
ntoUtficU l^Utt and Wricv iirimers (dpixwUnJ witli /<«wia 
and iz/ikIUI ivst/lctlon site sequences n« tliv *s' viuU uful 
clonal liilo ptilO^- 3Z (rronifj-u CUWfV, Mvdwou, Wl). Tlu» 
rcsuUtnKClnnr, pVSl'M. was u»c(i Icir transicni transfealon 
of 293 cdU; 



Ampltficaiion of Target DNA anU Duiccilpn or 
Amplicon Factor VHI Wasmid DNA 

- <|9Ffri'M) waj |i!ni»l»fluil Willi llic piimvii l^fuw 5'-<x;<:- 
<n'<KX;AAUAU:itJAtXMCn\V3' and I^J*r<.-v 5 -AAAc:<rr- 

t;AC5cxrrcKjAix«rrAOc-3'.Tii«j jv«ciu»ji iMoJuwU h 

K:K |jfOduci. Wc forwartl jariiiKT was ilv>l);iivd tu ux* 

r«};)un of i\w |kiiciU (XJlS2«ttv251> iilaMnKI anxl ilwAciotc 
(kivs H'vii|(fitM: ain|>llf/ Uk* human fncdir VIII 
g^TiVr I'TMiiafft woro chOKon ivitli Ui« av«ft;iii(t* iIk< itinu 
iHilcr proKraiu OIii;<» (Nufimnil liiusclonvcs, Invn I'ly- 
mouth, MN), The human p-actl« g eiu- wmx aini>Ilfled with 
Ihc prlxuco (5-tt* !iti AirwAal primer TCACClCAC:A<rr(:T 
GCCCATCrrAClOA-^* and ^-actin icvcrS<- piio^cr V.( :Af J. 

C00AACCC;f:i<:ArK;(:(:AA*j*CG-3'. The reacilon pro- 
ouctKi a EV5 np I'Cutprcxlutt. 

AmplificaUon reactions (50 (U) coniHincil a UNA 
sample. JOx Buffrx ll jxl), 200 iLM ilAlT, JCir, 
dcnv and /i(K) i^m riirri», 4 mxf Mg<:iy, iiniix AmpJi 
7Vi(; r;NA poiymcia«c, 0.5 unit Ampi'jrasc uracU /^-jjly- 
MMylujH- <UHO)/ 60 pmok* of each fncloi VIII |irliuvt, iind 1£ 
piii<4<* <if iitti^li |4 9icOr> ptUno^ 71)0 «<«Q('tluiu mIm» i»»iilttlnc<J 
On« of the foMowlit^; ^rkrtlnii |irolM*.H (HMl nM cprh): 

j/(9jirt.»i>r .s'(WAM)Ac:crrcrrc:fiM:<rr<!f.-iT<rrm:roT- 

GCCTr(TAKiRA>p 3' «u(i p-avliji proU- 5' (TAM)ATGCXX:- 
XCrAMKA)CCCCC:ATCCCATC|V.l' where p indicates 
ph^MphofylaUon and X Indicates a Unkcr ami nucleotide. 
RcacUoii IuIht^ w<tl- Mi«»t?Amp Optical 1\(bcs (part AtJin- 
bcrNKOI 09X\ IVsrkjn USnwO thai %vOfvfroM(Hl (Ml IH'rkIn 
lUmcr) tv i'mvvciiI liglil /rom /cflccthi^* Tube cap» w«jre 
similar in K4K*r«»An'ip i^pa iHit specially designed to prc- 
Ycin U|;hl »(»iittvrHi5..AII ol IM* tHU< i;u#ii«Mf«)M>U*« wcry 
l«hca hy PK Applied Il90«/M«'ni» <|Vt«U«r OHy, CA) cxecpt 
Ihc facKn* VIII prituvn, which w«*it* synthcslxrcl at Ccfivfi 
lech, luc. (Scnith 5<m« rranclsco, CA), Pcolicv ww dc^%n<.'d 
u.\lu$ the Oligo 4.0 .Wi vrnrc^ . foUvwliiK gutdelliics mik- 
2(CSied In mc Mcnlcl 7700 .sequence l>CKX'U»f histtuuienl 
niunuat Htlcny,. probe ^iimiU he at least 5*'C J)l|Shrr 
than fht* anni'alUiK l«ui|A*i<iUfrc used during; thrrmal ey- 
rhtig; primers sho\dd not Xmhm M«*hk* dupiexefi' wiih Ihc 
pfohe. 

The tUeniJ4il t'ycHng eundilKiii.H Included 1 tuUi ai 
^O't^ and 10 mU% at 95"C. Ilichiial cydulg prnrcrded with 
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n^ctiom were pcrfoniied imh<* Modol 7701) .Scqucna* IV- 
tcniiir (PU AppHe<] Ulojsyvivuiv), vjU\cU cuiitxhis 'a Ociw - 
Ani|» WiK Systum V<>0(X ktactlon wudHion^ w^-rr- pr<i» 
gruilOllcU oil *t K*w«r Macititonh '/10(1 (Apple CV,inipiilnr, 

.Santa Qara, C^v) linked diivcMy lo the Model WOO Si*- 
quell w iXiteclor* Atia9y«l« ♦►f data nlvu nerf^»rm#'H *>n 
the )w(.»flntf»«h contpMtcr. f V>llmHlo» and analydfi «>fi ware 
wits dovvktpMf at It: Applied nto«ystuitis. 

Transfection of Cells with Faaor VIH Construct 

]UmT ri 75 fUsks of 293 cells {ATCX: CIRHS?:*), a human 
fetal kidney suspeofcion cell line, wvrc gmwi (o 800(» con- 
Ili4cncy aftd tranifcctvd plVI*M. CX'IU wore Rrowii in tlu» 
tollowhtg incdltt! SC)% HAM'X HI 2 withotil GHT. ^tVJT* Iduj 
lilucose J^itJlKC<.N»'5 fmjdiflcd Koxic nicdiuio (I JMliM) wlUi« 
oin glyiific wiUi sodium bicarNwiatc, 10% Jctal Uivinc 
scruiri, 2 him L^liitauiinc, and 1% pcnidUiii-slrcptomy- 
iln, 'llic media was dianfjcd 30 mlp M^*'*' «he lran={cc 
lion. pl-OTM DNA amounts of 40, 4, OS, and 0.1 h? wvre 
aUdctl io ml of « $olutJt"Vconta)nlnR 0.125 m C:uap,- 
and 1 X IU'4'U5. Tltc four rtiiKhm-s were left al room ttMu- 
pcmtijrt* f<M lO min aa%<l thcti «#d*k-*l d«i|iwLM' i<> dw caWa. 
'Hiv nw*k> wviv hi^ulxilcd al 37°C: an*i 5.^* < f<ir 24 hr. 
washed with PUS. jit id r<L««i.^pcndcd In PltS. 'Hie K'mia 
jAMuUtd cells wre divided into itliqMoin and \)UA was O'- 
tTHCted liTiniedlatcly usiiiK the QIAantp Winwl Kil (Qiaj-en. 
aj«t,imnlli, CA). HNA w<i.s dwied Into 200 fi^l ol 30 tuu 
IVMICJ alpll«.0. . 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-l and WISP-l, that are up*regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4, Together with a third related gene, 
WISP-J, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1, These included (/) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracylihe repressible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-l gene was localized to human chromosome 
8q24.1-8q24j. WISP-l genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpr«ssed (2- to >30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-l mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 

Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2), Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-l is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3p (GSK-3P) resulting in an increase in 
0-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (A PC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by CSK-3^, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1), 
' Although much has been learned about the Wnt signaling 
path>yay over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Amopg the candidate Wnt target genes are 
those encoding the tiodal-related 3 geiie, Xnr3, a member of 
the transforming growth factor (TGF)-^ superfamily, and the 
homeobox genes, engrailed, goosecoid^ twin 0Ctwn% and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA : 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype. characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13), We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnl-1 transformed cells, WISP-l 
and mSP'2, and a third related gene, WISPS, The W75P genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, traxisforming growth factor, CTGF. connective 

tissue growth factor, SSH, suppression subtractive hybridization; 

VWC, von Wiilebrand factor type C module. 

Data deposition: The sequences reported in this paper have been 

deposited in the Genbank database (accession nos. AF100777, 

AF100778, AF100779, AF100780, and AF100781). 

tTo whom reprint requests should be addressed, e-mail: diane^gene. 
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cDNA was synthesized from 2 ;ig of poly(A)"*' RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poly(A)"*' RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WlSP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-l 
were isolated by screening AgtlO lung and fetal kidney cDNA 
jibranes with the same probe at low stringency. Clones en- 
: coding full-length mouse and human WlSP-l were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
.with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-B were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PGR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /iM of each dNTP at 
94"C for 1 sec, 62'C for 30 sec, 7rC for 1 min, for 22-32 cydes. 
WISP and glyccra!dchyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. ^^P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-l or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse M<^5f-2, All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Hunisville» AL) arid human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Ceil Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T), DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over GsQ cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene ampli^cation and RNA expression of WlSPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined quantitative PCR. Gene-specific primers and 
fluorogeiiic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2(^} where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
a-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The If /SP-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Ehner Applied Biosystems. 

RESULTS 

Isolation of WISP-lznA WISP-2 by SSH, To identify Wnt- 
l-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-l and WISP-2, were differentially, 
expressed, being induced in the C57MG/ Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on p-catenin levels (13, 14). Expression of WISP-l was 
up-regulated approximately 3-foid in the C57MG/Wnt-1 cell 
line and WlSP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-l mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP ^ A isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WlSPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-l were isolated and the 
sequence compared with mouse WISP- 1, The cDNA sequences 
of mouse and human WlSP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367. aa, with predicted 
relative molecular masses of ««40,000 (Mr 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 2A). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of •*^27,000 (M^ 27 K) (Fig. 25). Mouse and human 
WISP-2 are 73% identical. Human WISP'2 has no potential . 
N-linked glycosylation sites, and mouse WISP'2 has one at 
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Fig. I. WlSP'l and W1SP 2 are induced by Wnt-1, but not Wnt-4, . 
expression in C57MG cells. Northern analysis of WISP-l {A) and 
mSP'2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A)* RNA (2 ^g) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse iVISP-1 -spedfic probe 
(amino acids 278-300) or a 190-bp »75/'-2-spccific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
. human ^-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WTSP-l (A) and mouse and human WISP'2 {B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
\^C, thrombospondin (TSP), and C>terminal (CT) domains are 
underlined 

position 197. WlSP-l has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-l. 

Identiflcation of WISP-S. To search for related proteins, we . 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WlSP-3 cDNA of 1371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass ojf 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WlSP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 14). 

»75ft Are Homologous to the GTGF Family of Proteins. 
Human WISP-l, mSP'2, and WISP'3 are novel sequences; 
however, mouse WISP- J is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses, the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells. (15). 
Human and mouse WISP'2 are homologous to the recently 
described rat gene, rCop-l (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-p (17), Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, . similarity to Wnt-1. All are 
secreted, cysteine-rtch heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3^) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (OCGC- 
CXXQ conserved in most insulin-tike, growth factor (IGF)- 
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Fig. 3- (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISPO are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain . 
. that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mR^A in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagehs and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WiSP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B), 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B), The.existence of a putative . 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins; an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Ussue- 
specific expression of human WISPs was characterized by- PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WlSP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was delected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WlSP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of W!SP'3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WlSP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localizatiob of WISP-l and WISP-2. Expression of 
WISP- 1 and W75/'-2 was assessed by m situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WlSP'l was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A^D). However, low- 
level WlSP'l expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-l^ W!SP'2 expression also was seeii in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 ErH), However, WISP-l expression in the stroma was 
in spindle-shaped cells adjacent to capillary, vessels, whereas 




Fig. 4. {A , C, £, and G) Representative hematoxylin/eosin^tained 
images from breast tumors in Wnt-l transgenic mice. The correspond- 
ing dark-field images showing WISP- 1 expression are shown in B and 
D. The tumor is a moderately well-KJifferentiatcd adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B), 
expression of WlSP-i is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and O), and tumor cells are negative. 
Focal expression of WISP- 1, however, was observed in tumor cells in 
some areas. Images of WISP-l expression are shown in E~H. At low 
power (£ and f), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor ceils are 
negative (G and K), 
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the predominant cell type expressing WISP-l was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes, The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-J is approximately . 
3.48 cR from the meiotic marker AFM259xc5 (logarithm of 
odds (lod) score 16.31} on chromosome 8q24.1 to 8q24,3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28), Preliminary fme 
mapping indicates that WISP-l is located near D8S1712 STS, 
WISP'2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-^0ql3.L Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression otWISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in . 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression, 
and poor prognosis (30). Because WlSP-I resides in the same 
general chromosomal location (8q24) as c-myc, wis asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by qtiantitative PGR and Southern blot analysis. (Fig. 
SA and B), Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4<foId) 
amplification, with the HT-29 and WiDr ceil lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PGR (Fig. 6). The copy number of WlSP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors {P < 0.001 for each). The 
copy number for WlSP-3 was indistinguishable from one {P = 
0.166), In addition, the copy number of WISP-2 was signifi- . 
cantly higher than that of WISP-l {P < 0.001), 

The levels of WS/' transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 





Fig. 5. Amplification of WISP- 1 genomic DNA in colon cancer cell 
lines. {A) Amplification in cell line DNA was determined by quanti- 
tative PGR- (B) Southern blots containing genomic DNA (10 >tg) 
digested with EcoRl (WISP l) orXbal (c^myc) were hybridized with 
a 100-bp huTmn WTSP-1 probe (amino adds 186-219) or a human 
c-myc probe (located at bp 1901-2000), The WISP and myc genes arc 
detected in normal human genoniic DNA after a longer film exposure. 
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Fio. 6. Genomic ampUficadon of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in. 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of mSP-I 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WfSP-1, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the norma! 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is. listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplk^te. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
ceils and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C37MG mouse mammary epithelial ceils transformed by 
Wnt-1. 

Three of the genes isolated, WISP-l, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not dear whether the WlSPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., P-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through ^-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, .which in turn 
regulates WISPs, 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WlSP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WlSP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-j3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WlSP-1 and WISP-3 may exist as dimers, whereas WlSP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin Oy^a serves as 
an adhesion receptor for Cyr6i (33). 

The strong expression of WISP-l and WISP'2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
. tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-0i, which is the stimulus foir 
stromal proliferation (34). TGF-^l. is secreted by a large 
percentage of malignant breast tumors and may be one of the 
^owth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest thait WISP-I and WISP-l expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wni-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP- 1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35), The localization of WlSP-1 
and WISP'2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysts of WlSP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas oyerexpression of 
WISP'3 RNA was seen in the absence of DNA amplification. 
In contrast, W75/*-2 DNA was amplified in the colon tumors, 
but its m RNA. expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WlSP-ft was localized to chromosome 20ql2-20q 13, at a region 
frequehtly amplified and associated, with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 ampHcon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP'2 may be caused by another gene in this 
amplicon. 

. A recent manuscript on rCop-I, the rat orthologue of 
WlSP-2, deiscribes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of . 
growth in cell lines (16). Although the mechanism by which 
WISP'2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WlSP-l 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and p-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic 0-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WlSPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or A£QK were dissolved in water, made isotonic with 
NaCl and diluted into RPMl growth medium. T-ccU- proliferation assays were 
done essentially as described'*^*. Briefly, after antigen pulsing (30p.gmr' 
TTCF) with tetrapeptides (l-2mgmr'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% gluuraldehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% PCS before co -culture with T-ccU clones in 
round-bottom 96-well microtitre plates/ After 48 h, the cultures were pulsed 
with 1 |iCi of 'H-thymidine and harvested for scintillation counting 16 h later. 
Piedigestion of native TTCF was done by incubating 200 ^g TTCF with 0.25 p.g 
pig.kidney legumain in 500 )l1 50 mM derate buffer. pH 5.5. for 1 h at 37 "C. 
Glycopeptide digestions. The peptides HIDKEEDI. HlDN(N-glucosamine) 
EEDI and HIDNESDI. which are based on the TTCF sequence, and 
QQQHLFGSKVTDCSGNFCLFR(KKK). which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGShArrDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced carboxy-methybted human transferrin followed by 
concanavaiin A chromatography**. Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The iyophilized transferrin- 
derived peptides were redissoKed in SO'mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol Digestions were performed for 3 h at 30 'C with 
5-50 mU ml'* pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgmr' a- 
cyanocinnamicadd in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- . 
dization was obtained with a matrix ion of 568.13 mass units. 
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• Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (progranuned cell death) in 
target cells through the deadi receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate inunune- 
cytotonc killing of cells that are potentially harmhil to the 
organism^ such as virus-infected or tumour cells^ Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumour!^ studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent inunune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamlly'. Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegenn (OPG)', 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane -asscociated,. 
molecule. We expressed a recombinant, histidine -tagged form of 
PcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identir/ in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%)., All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig, lb). We detected a predominant l.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480; 

To investigate potential ligand interactions of DcR3. we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to -human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL* (Fig. 2a), but not to cells transfected with 
TNF*. Apo2L^^RAIL*■^ Apo3UTWEAK':''. or OPGL/TRANCE/ 
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RANKL'**'" (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from f asL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFRl. Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas- Fc bound to soluble 
FasL with a comparable affinity (Kj = 0.8 ± 0:2 and 
llliO.lnM, respectively; Fig. 2e), and that DcR3-Fc could 
block ncariy all of the binding of soluble FasL to Fas-Fc (Fig., ,2c, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To deterniine ydiether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a), DcR3- . 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
--0,1 ji^ml"*. Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Figi 3b). We also tested the effect of DcR3-Fc on artivation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process'. Consistent with previous results", activation 
of interleukin-2rstimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



DcR3 
OPG 



1 (hIr AfDE G pgISs lTlIc lvlalpall pVFpIa V 1 

I ttljN KlUL C C AUJv FLLJD ISIKWTTQETFtil-- 



DcR3 32 E 
OPG 26 



E tIpItIyIp w RElAfi^rlG e RfilvfElA QlG^p^R.^c?jrjp V 0 R pfc] 
- -ISkuJl h ylnlEL^s H oiilLldD kIc^p^p^g^Iy l k o hIcJ 



DCR3 £3 R R D S p[5JtE]g|51P1p R h|^Q fIwIm Y L E R(G]R[xic]N V 
OPG SS T A K ff KgdvlaAt^fiJo H YUOUD sIhJh J S.D E\t}L\XlSlS P 



DCR3 94 h C 



OPG 86 . VlfilKtHL Q Y V K Q 



G S 



R E E E A 



R a[c]h AtT'«{ir r|a(c1r|H1r t(g]f f a h a g 

Q RiriM R lT-gHf Ri vlclElciK ElfijR Y L E I E 



is^g:W'Gi F rc?vl v o AifiMfilE RuiilvlclK RiciaD 



OcR3 156 
OPG 146 



N E TLS;:ajK a 



DCR3 187 S SlppjT LTcIT SCTGFPLSTRVPGA 
OPG 179 A -Hti-DlM iIg|s GHSBSTQKCG ID - V 



- ji 

q[c]q pfHlRlM^ClT A LjG^LlA(li|N V pfcls 
pIcJr KlHlT lt^ cI s V FbS^LUjT Q KlfijN 

E E|e:B]R|A]V I D 

T lIc^e|a|f P R 



DcR3 216 
OPG .209 



HRv A P Q 0 I S I K rU}q RfinL Q AfZlE APECWGPT-pIr] 

UJa V p t k f t p n wlils vjijv d nLlIp GTKVNAESV ElfiJ 

DcR3 248 A gIrJa ALQLKLRR RfOt E^L G h\Q\v g|a]l - lMr. L L 
: OPG 240- I Klalo HSSQEQTF QlijL kLuw K HlfljN kIaJq D ibUK K I 

M p G L e[r]s V r e r fClIp V H ; 

E N S V QliUH I G H A HUjf P B : 



0CR3 276 Q A L R V A R 
.OPG 271 IQ O 10 LC 



300 
; 293 



^ O 




F0tal 



it J J 



C«IIKne 



Figure 1 Primaiy structure and expression of human OcR3, a, Alignment of the 
amino-add sequences of DcR3 and of osteoprotegerin (OPG): the C-terminal 101 
residues of OPG are not showa The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the A/-linked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the OcR3 cONA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL. peripheral blood 
lymphocyte. 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural kiDer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target celb from —65% to 
—30%. %vith half-maximal inhibition at — 1 pignU"*; the residual 
killing was probably mediated by the perforin/granzymc pathway 
Thus, DcR3 binding blocks FasL-dependent natural Idllcr cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural kiUer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL''. 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3,can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 2 Interaction of OcR3 with FasL a. 293 cells were transfected with pRKS 
vector (lop) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area), TNFRl -Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap). and analysed for binding by. 
FAGS. Statistical analysis showed a significant difference (P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE. phycoerythrin- 
labelled cells, b, 293 cells were transfected as in a and metabolicaily labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFRl, OcR3 or Fas. 
c. Purified soluble FasL (sFasL) was imrriunoprecipitated with TNFRt -Fc. OcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d. Rag-tagged sFasL vyas 
incubated with OcR3-Fc or with buffer and resolved by get filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e. Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Rag. 
Inset competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PGR)" in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PGR primers and probes; we observed nearly the 
same amplification (data not shown). 

.. We then analysed, DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2. colon tumours, 2 out of 5 
breast tumours, and 1 out of I gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium; but was essentially absent from adjacent stroma, indicating 
tirniour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequendy in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA egression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, ias shown by quantitative PGR (fourfold) and by Southern blot 
hybridization (frvefold) (data not shown). 

If DcR3 amplification in cancer is fiinctionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival To test this, 




Time (h) Inhibitor ()ig mH) 



Rgure 3 Inhibition of FasL activity by OcR3. a. Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasU 5 ng m!"^) oligomerized 
with anti-Flag antibody (0.1 M-g^r') in the presence of the proposed inhibitors 
OcR3-Fc, Fas-Fc or human IgGi arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b, Jurkat cells were irK:ubated with sFasL-Flag.plus anti-Flag antibody 
as in a; in presence of 1 p-g mr' DcR3-Fc (filled circles), Fas-Fc (open circles) or 
human tgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripherai bkx)d T celts were stimulated with PHA and interieukin-2. 
followed by control (white bars) or anti^03 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGi, Fas-Fc. or DcR3-Fc(lO»igmr'). 
After 16 h. apoptosis of CD4' cells was determined (mean ± s.e jn, of results from 
five donors), Peripheral blood natural killer cells were incubated with *'Cr- 
labelled Jurkat cells in the presence of OcR3-Fc (hUed circles). Fas-Fc (open 
circles) or human IgGi (triangles), and target-cell death was determined by 
release of ^^Cr (mean ± s.d. for two donors, each in triplicate).- 
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we mapped the human DcR3 gene by radiation- hybrid analysis; 
DcR3 showed linkage to marker AFM2 18xc7 (1160), which maps to 
. chromosome position 20ql3. Next, we isolated from a bacterial 
artificiar chromosome (BAG) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAG, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3-linked reverse marker showed an average amplification of 
roughly threefold, slighdy less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 'may be at the *epi- 
centre* of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detea DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG*"". . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fa$^°. A second mechanism involves proteolytic 
shedding of FasL from the cell surface'^ DcR3 competes with Fas for 



a b d 




Figure 4 Genomic amplih'catbn of t^R3 in tumours, Lung cancers, cornprising 
eight adenocarcinomas (c. d. f. g. h, j, k. r). seven squamous-cell carcinomas (a. e. 
m, n. 0. p, q). one non-small-cell carcinoma (b). one small-cell carcinoma (i), and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b. Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c, /n situ hybridization 
analysis of OcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding daric-field image 
(right) show OcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S). blood vessel (V) and necrotk: tumour tissue 
(N) are also shown, d. Average amplification of (DcR3 compared with ampiifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd), the 
OcR3-linked marker T160. and other chromosome-20 markers, in the nine colon 
tumours showing OcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's Mest 
comparing each marker with OcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-l has been 
described^'. In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L". Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-faxnily member is 
OPG*, which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L", Thus, DcR3 and OPG define a new subset of 
TNFR-family members, that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response^ Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. ■ Q 



Methods 

Isolation of DcR3 cONA. Several overbpping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Ufescq™ (Incytc 
Pharmaceuticals; accession numbers 1339238. 1533571, 1533650, 1542861. 
1789372 and 2207027) showed simUarity to members of the TNFR family. We 
screened human cDNA libraries by PGR with primers based on the region of 
EST consensus; fetal lung was positive for a produa of the expected size. By 
hybridization to a PCR-generatcd probe based on the ESTs, one positive done 
(DNA50942) v/zs identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more dones; the coding regions of these dones were identical in size to that of ' 
the initial done (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire pcR3 sequence, or the 
ectodomain of Fas or TNFRt, was fused to the hinge and Fc region of human 
IgGi, expressed in insect SF9 cdb or in humaii-293 cells, and purified as . 
described". V 

Ruorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using caldum phosphate or Effectene (Qiagcn) with pRK5 vector or pRK5 
encodii^ full-length human FasL^ (2 m^), together with pRK5 encoding CrmA 
(2}ig) to prevent cell death. After 16 h. the cells were incubated with 
biotinylated DcR3-F,c or TNFRl-Fc and then with phycoerythrin^conjugated 
strcpuvidin (GibcoBRlj/and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov' statistical analysis. There was some detectable staining 
of vector-transfcctcd cells by DcR3~Fc; as these cells express little FasL (data 
not shown)»: it is possible that DcR3 recognized some other factor that is 
expressed constitutivdy on 293 cells. 

Immunoprecipitation. Human 293 cells were transfected as above, and 
meubolically labelled with P^Sjcysteine and I "S] methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-ftnk (lOjiM), 
the medium was immunoprecipiuted with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(Sjig). followed by protein A-Sepharose (Repligen). The predpiutes were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified. Flag-tagged soluble FasL (1 fAg) (Alexis) was incubated 
with each Fc-fusion protein (1 jig), precipitated with protein A-Sepharosc, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25jjLg) was 
incubated with buffer or with DcR3-Fc (40 ji-g) for 1.5 h at 24 "C, The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and devdoped 
with PBS; 0.6-ml fractions were coUected. The presence of DcR3~Fc-FasL 
complex in each ftaction was analysed by pladng 100 ^,1 aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc. 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
strcptavidin-horscradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular maiss of the complex of 420K (data not 
shown), which is consistent with a stoidiiometry of two DcR5-Fc homodimcrs 
to two soluble FasL homotrimers: ■ 

Equttibrjum binding.analysjis. Microtitre wells were coated with anti-hftuxian 



IgG, bldcked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AlCO. CD3* lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltcnyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 ^g ml'') for 24 h, and cultured' 
in die presence of interieukin-2 ( 100 U mP' ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for 3^>optosis 
16 h later.by FACS analysis of annexin-V-binding of CD4* cells". 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltcnyi 
Biotedi), and incubated for 16 h with ''Cr-loaded Jurkat cells at an effcaor- 
to-urget ratio of 1:1 in the presence of PcR3-Fc, Fas-Fc or human IgGL 
Target -cell death was determined by release of **Gr in cffector-urgct co- 
cultures relative to. release of "Cr by detergent lysis of equal numbers of Jurkat 

Cdls. 'at- 

Gene-amplification analysis.. Surgical specimens were provided by j. Kern 
(lung tumours) and R Quirke (colon tumours). Genomic DNA was extracted 

• (Qiagen) and the conceotratioh was determined using Hoechst dye 33258 
intercalation fluororoetry. Amplification was determined by quantiutive PGR" 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PGR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were dcsigiied on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAG 
carrying the human .DcR3 gene; alternatively, primers and probes were based 

'■on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159). the nearest 

- available marker which maps to -^500- kilobases .from T160, and five extra 
markers that span chromosome 20. The DcR3-spedfic. primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3* and die 
fluorogem:c probe sequence was 5'-(FAM-ACACGATG.CGTGCTCCAAGCAG 
AAp-(TAMARA). where FAM is 5' -fluorescein phosphoramiditc. Relative 
gene-copy numbers were derived using die formula 2**^*, where ACT is the 
difference in amplification cydes required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes*. The recently completed Escherichia coU genome 
sequence revealed that the largest family of paralogous\E.' co/i 
proteins is composed of ABC transporters^. Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance'regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated 'with antigen processing 
{Tapl~Tap2). Here we report the crystal structure at 13 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from' Salmonella typhitnurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HlsP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains'. In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally hised into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and £ co/i'**"* is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex. HisQMP2, 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins', is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM*. The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis^ the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble. HisP also forms a dimer^: HisP has been purified 
and characterized in an active soluble form' which can be recon* 
sdtuted into a fiilly active membrane:bound complex". 

The overall shape of the crystal structure of the HisP monomer is 
that of an *L' with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. I). A six-stranded p- 
sheet (P3 and p8-p 1 2) spans both arms of the L, with a domain of a 
a- plus P-type structure (pi, p2, p4-p7, al and a2) on one side 
(within arm I) and a domain of mosdy a-helices (a3-a9) on the 
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Figure 1 Crystal staicture of HisP. a. View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested . 
to face towards the periplasmic and cytoplasmic sides; respectively (see text). 
The thickness of arm II is about 25 A; comparable to that of membrane. a-Helices . 
are shown iri orange and p-sheets in green, b, View along the two-fold axis of the ' 
HisP dimer. showing the relative displacement of the monomers not apparent in 
a. The strands at the dimer interface are labelled, c View of one monomer from 
the bottom. of arm I. as shown in a. towards arm II. showing the ATP-binding 
pocket a-c. The protein and the bound ATP are in 'ribtxjn* and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOUSCRIPT^. N. amino terminus: C. C 
terminus. 
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NOVEL APPROACH TO QUANTITATIVE POLYMERASE CHAIN REACTION USING 
REAL-TIME DETECTION: APPLICATION TO THE DETECTION OF GENE 
AMPLIFICATION IN BREAST CANCER 
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^Laboratoire de Genetique Moleculaire, Faculte des Sciences Pharmaceutiques et Biologiques de Paris, Paris, France 
^Laboratoire d'Oncogenetique, Centre Rene Huguenin, Si-Cloud, France 



Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase>chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor ON A. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-prbduct carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndf and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbBZ 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene' copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int i. 
Cancer 78:661-666, 1998. 
© 1998 miey-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et ai, 1 994) have also been used in broad 
searches for regions of increased DNA copy numbers in timior 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ (llql3),and€ri>B2 (17ql2-q21)(forreview, seeBiecheand 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Hems et ai, 1992; 
Schuuring et ai, 1992; Slamon et ai, 1987). Muss et al (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. . 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et ai (1987) between 
erbhl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
jig/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerfiil tool for quantitative 
DNA arialysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufiQcient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et ai, 1996; Heid et 
ai, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et ai (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai. 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., j6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydro! yzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye {i.e.. ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of Q 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no posi-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available Just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr, . 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors {myc, ccndl and er6B2), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southem-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

'■ Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular- weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by. the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter Ct (threshold cycle) is 
defined as the fractional cycle number, at which the fluorescence 
. generated by cleavage of the probe passes a fixed threshold above 
. baseline. The target gene copy number in unknown samples is 
quantified, by measuring Q and by using a standard cuive to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult lo assess. We therefore 
also quantified a control gene (albj mapping to chromosome region 
4qH-ql3. in which no genetic alterations have been found in 
breast-mmor DNA by means of CGH (Kallioniemi et al, 1994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N'\ and is determined as follows: 

copy number of target gene {app. myc. ccndl. erb^l) 

^ — '■ ■' ■■— — "~ . 

copy number of reference gene {alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Danibcn Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA), 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmcr Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request 

The TaqMan PCR Core reagent kit. MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
hg of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
usiiig MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- . 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/>iL The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging firom 10"' (10^ copies of each gene) to 
10" '° (10^ copies). This series of diluted PCR products was 
aliquotcd and stored at -SO'^C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 hg). 

PCR amplification. Amplification mixes (SO fil) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
lOX TaqMan buffer (5 fil), 200 pM dATP, dCTP, dGTP, and 400 
)iM dUTP, 5 mM MgClj. 1,25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermaf cycling conditions comprised 2 min at 
50X and 1 0 min at 95X. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65'C for I min. Each assay included: a standard 
curve (from 10-^ to 10^ copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genonriic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus, be analyzed on a 96-wcll microplate). All 
samples with a coefficient of variation (CV) higher dian 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-trme detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells, A charge-coupled-devicc (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates C, and detennines the starting copy number in the 
samples. 
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Determination of gene amplification. Gene amplification was 
calcuiated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndi and erbBl proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 lq2l.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994), The 
reference dtsomic gene was the albumin gene {alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was conslaicted from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/^l. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure I shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10^ copies or as 
many as 10^ copies. 

Copy-number ratio of the 2 reference genes (app and 2Ah) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure I - Albumin {alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10^ (A9), 10^ (A7), 10^ (A4) to 10^ (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. C, (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs, C, (direshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (a/?/?. 21q2l.2; alb. 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et al., 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 i 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA, The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject*s DNA would have resulted in difTerential amplification. 

myc, ccndl and ctbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast-, 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0,5 to 1.3 
(mean 0.84 ± 0,22) for myc, 0,7 to 1.6 (mean 1.06 ± 0.23) for 
ccnd] and 0.6 to 1.3 (mean'0.91 ±6.19) foverbBl, Since N values 
for myc, ccndl and erbB>2 in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. . . 

myc, ccndl and cxbB2 gene dose in breast-tumor DNA 

myc. ccndl and erbB2 gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndJ were 
more frequent (23%, 25/108) than extra copies of erbBl (15%. 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccnd J. 2 to 15.1 for erbB2, and only 2 to 4,6 for the myc gene. 
Figure 2 and Table 11 represent tumors in which the ccnd] gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (TH8), 
The 3 genes were never found to be co-amplified in the same tumor. 
erbBl and ccndJ were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc znd erbBl in I case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the I7q21 region (the site of erbBl). No such decrease. in copy 
number was observed with the other 2 proto-oncogencs. 

^Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 108 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N S 5). However, there were cases (I myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N fi'om 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
cunently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR mye. 
ccndl AND erb^l GENES IN 108 HliMAN BREAST TUMORS 



Gene 




Ampiification level (N) 






0.5-1-9 2-4.9 




myc 


0 


97(89.8%) 11 (10.2%) 


0 


ccndl 


0 


83 (76.9%) 17(15.7%) 


8 (7.4%) 


erbBl 


5 (4.6%) 


87 (80.6%) 8 (7.4%) 


8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene fi'om minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in tumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et aL 1 994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented b>y using the enzyme uracil N-glycosylase (UNO) 
(Longo et aL 1990). The second advantage is. the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are. 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the riumber of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables Q to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the Ct value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). Iii" 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C, ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disa vantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
eni blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained iii 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996; Slamon et aL, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time*consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in = 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature, (r) Chromosome regions 4qll-ql3 and 21q21.2- 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et aL, 1994). (/i) We found Aat 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Bcms etal., 1992; Borg et al., 1992). (iii) 
The frequency and degree of myc amplification in our breast tumor 

.'DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et al. (1 992) and Courjal ei al^ 
(1997). (iV) The maxima of ccndl and erbBl over-representation 

. were 18-fold and 1 5-fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: TU 8 (E 1 2, C6» black squares), T 1 33 (G 1 1 , B4, red squares) 
andTHS (A8, C8, blue squares). Given the Q of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table U. 



30-fold maximum) (Bems et at., 1992; Borg et al, 1992; Courjal et 
a/., 1997). (v) The er^B2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et al,, 1 995; Deng et at,, 1996; Valeron 



et at., 1996). Our results also correlate well with those recently 
published by Gelmini et ai ( \ 997), who used the TaqMan system to 
measure er6B2 amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystenis) which only allows end- 



666 BIECHE ETAL. 



TABLE II - EXAMPLES OF ccndJ GENE DOSAGE RESULTS 
. . FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




HccndUalb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


. T118 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


, 7672 


316 


16.34 




121722 






7933 









'For each sample, 3 replicate cjqjerimenis were performed and the mean 
and the standard deviation (SD) was determined The level of ccndl gene 
amplification (t^cmH/alb) is determined by dividing die average <xndj 
copy nomber >^ue by the average o/b copy number value. 



point measurement of fluorescence intensity. Here we report myc 
' and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>:5-fold). The slightly higher frequency of gene amplification 
(especially ccndl znd erbBl) observed by means of real-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, letrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the fiiture by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et ai, 1992; 
Slamone/a/.. 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbBl (but not of the other 2 proto- 
oncogenes) in several tumors; erbB2 is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bteche and Lidereau, 1995). . 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for eariy diagnosis of cancer, 
staging, prognostication and choice of treatment Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and con^lex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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Genetic Instability in Epitlielial 
Tiissues at Risk for Cancer 

WALTER N. HTITELMAN 
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Abstra^ct: Epithelial tuinors develop through a multistep process driven by 
genomic instability frequently associated with etiologic agents such as pro- 
longed tobacco smoke exposure or human papilloma vims (HPV) infection. 
The purpose of the studies reported here was to examinie the nature of genomic 
instability in epithelial tissues at cancer risk in order to identify tissue genetic 
biomaifkers that might be used to assess an individual's cancer risk and 
response to chemopreventive intervention. As part of several chemoprevention 
trials, biopsies were obtained from risk tissues (i.e., bronchial biopsies from 
chronic smokers, oral or laryngeid biopsies from individuals with premalig- 
nancy) and examined for chromosome instability using in situ hybridization. 
NeaJ*ly all biopsy specimens show evidence for chromosome Instability 
tnrougnout the exposea tissue, increased chromosome mstability was observed 
with histologic progression in the normal to tumor transition of head and neck 
squamous cell cardnomas. Chromosome mstability was also seen in premalig- 
nant head and neck lesions^ and high levels were associated with subsequent 
tumor development In bronchial biopsies of current smokers, the level of 
ongoing chromosome instability correlated with smoking Intensity (e.g., 
packs/day), whereas the chromosome index (average number of chromosome 
copies per cell) correlated with cumulative tobacco exposure (i.e., padc-years). 
Spatial chromosome analyses of the epithelium demonstrated multifocal clonal 
outgrowths. In former smokers, random chromosome instability was reduced; 
however, clonal populations appeared to persist for many years, perhaps 
accounting for continued lung cancer risk following smoking cessation. 

Keywords: chromosome histabHity; epithelial cdis; aerodigestiTe tract; 
chemoprevention; cancel risk 



THE NEED FOR BIOMARKERS OF CANCER RISK AND 
RESPONSE TO INTERVENTION 

Epithelial cancers remain a major health challenge in the world. Despite improve- 
meats in staging and the application and integration of suigery, radiotherapy, and 
chemodierapy, the 5-year survival rate for individuals with lung cancer is only about 
15%.^ Even if strategies for early detection are successful and lung cancers 
are detected at a stage where local tumor resection and treatment is curative, 
these patients will still be at significant risk for developing second primary tumors 
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ailssodated with the problem of field cancerization.^ Similarly, for individuals with a 
first head and neck primary tumor, even if the first malignancy is successfuUy treat- 
ed, the risk of developing a second primary in the tobacco smoke-exposed field is 
approximately 40%.^ Similar cancer risk estimates exist for individuals who exhibit 
severe dysplasia in premalignant epithelial lesions.^ For these reasons, it is important 
to focus on chemopreVentive strategies to prevent the development of epithelial 
malignancies. 

Several problems confront chemoprevention trials designed to identify efQca- 
cious agents.^ First, chemoprevention trials with cancer incidence as a primary end- 
point require tens of thousands of subjects and tens of years of intervention and 
follow-up for statistical evaluation. For example, a recently reported trial involved 
30,000 subjects and required 10 years in order to examine the impact of prevention 
strategies on lung cancer development, only to find a possible increased lung cancer 
incidence in current smokers who received p-carotene.^ 

The problem of large, long^tenn trials results from the difficulty in identifying 
individuals at highest canc» risk who might best benefit from chemopreventive 
int^ention. For sample, 20 pack-year smokers^ while known to be at relatively 
increased risk for developing lung cancer, have approximately a 10% lifetime risk 
for developing lung cancer. 'This seriously limits die number of potentially useful 

-.strategies that r a n hp rlinically.£xp lnrp/1 A Rf^rnnd prnhl ftm faring rhftmnprftvpntion 

trials is that little is known about what agents are likely to have efficacy, and even 
less is known regarding proper dos^s, schedules, and durations of treatment Part of 
the reason for this problem is that too litde is known about the physiologic processes 
that drive epithelial cancer development 

in order to reduce the number of subjects and the time required to carry out 
chemoprevention trials and thus allow the exploration of multiple prevention strate- 
gies, two types of advances are necessary. First, it is important to identify individuals 
at significantly mcreased cancer risk who might best benefit firom different types of 
intervention. Second, in order to allow the rapid identification of agents, doses, and 
schedules of potentially efficacious agents, it is necessary to identify and validate 
surrogate endpoints of response that indicate whether the agents are having a posi- 
tive impact on the target tissue during the chemopreventive intervention. 

One approach to identifying individuals at increased aerodigestive tract cancer 
risk is to explore epidemiologic features of potential subjects. Molecular epidemio- 
logic studies are beginning to identify intrinsic host factors that place some individ- 
uals at increased cancer risk, especially those with a chronic smoking history.^ Most 
intrinsic factors identified thus far reflect levels of carcinogen metabolism, repair 
capabilities of the host following DNA damage, and other measures of intrinsic 
cellular sensitivity to muUgens. While these factors can provide statistically signif- 
icant risk ratios in case-control studies that are controlled for tobacco exposure, the 
detected risk ratios usually fall in the range of 1.5 to 10. Unfortunately, this is not 
sufficient for the individualization of treatment and is not sufficiently high to signif- 
icantly reduce the numbers of subjects -required for chemoprevention trials with 
cancer incidence as the primary endpoint. 

Another approach to identifying individuals at increased cancer risk is to directly 
examine the target tissue of individuals with known carcinogen exposure (e.g., 
chronic tobacco smoke exposure), who have evidence of target organ dysfunction 




(e.g., chronic obstructive pulmonary disease, changes in voice quality), or who 
have clinical evidence of premalignancy (e.g., bronchial metaplasia/dysplasia, oral 
leukoplakia/erythroplakia, cervical intraepithelial neoplasia). The conventional 
standard for assessing cancer risk in these situations is the degree of histological 
change. However, while individuals who show moderate to severe dysplasia are 
known to be at increased cancer risk when compared to individuals wilfa Icsset his- 
tologic changes, it is often difficult to distinguish reacdve changes to carcinogenic 
insult from initiated and progressing lesions. Similarly, upon cessation of carcino- 
genic insult, histologic changes may reverse yet cancer risk may continue for many 
years. For example, while smoking cessation is associated with decreased bronchial 
metaplasia;^ increased lung cancer risk continues for many years beyond smoking 
cessation.^^ In fact, nearlv half the newly diagnosed lung cancer cases in the USA 
occur in former smokers. 

The development of assays to identify individuals at high epithelial cancer risk 
and to directly assess response to intervention in the target tissue is therefore an 
important research goal. Such assays should be objective and easily quantifiable and, 
if possible, minimally invasive. Moreover, they should reflect botii the disease pro- 
cess and the targeted pathway and thereby be useful in assessing risk and monitoring 
response to intervention as well as directly testing the hypothesized mechanism of 
ai ction of the chemo pr eventive s te ate^ — 

In the chemoprevention setting it is important to recognize that one does not 
know the location of the future cancer. Thus, assays must necessarily be carried out 
on random biopsies of the field at risk. Even if there are clinically evident premalig- 
nant lesions, this does not mean that this is the likely site for a future malignancy. 
For example, nearly half of the cancers that develop in individuals with oral leuko- 
plakia arise away from the original index lesion. Similarly, since many newly diag- 
nosed lung cancers arise in the peripheral parts of the lung (e.g., adenocarcinomas), 
especially iii former smokers, and since endobronchoscopy predominantly accesses 
central components of the lung, it is important to identify biomarfcers that can reflect 
global processes ongoing in the target epithelial field associated with increased can- 
cer risk. Hieir discovery requires a better understanding of the tumorigenesis pro- 
cess in epithelial fields at cancer risk. 



THE RATIONALE FOR STUDYING 
GENOMIC INSTABILITY AS A MARKER OF RISK 

TXimors of the aerodigestive Uract have been proposed to reflect a "field canceriza- 
tion" process whereby the whole tissue is exposed to carcinogenic insult (e.g., tob- 
acco smoke) and is at increased risk for multistep tumor development.*^^^ Several 
types of clinical and laboratory data support this notion, including the frequent 
occurrence of synchronous primary and subsequent second primary tumors in the 
aerodigestive tract (frequently exhibiting dissimilar histologies as well as distinct 
genetic signatures ^'*~^^) and the presence of premalignant lesions that precede and/or 
accompany the tumor in the exposed tissue field. The notion of a multistep tumor- 
igenesis process is further supported by serial clinical and histologic evaluations of 
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taiget tissue or exfoliated cells where increasing degrees of histological abnormali- 
ties are observed over time.** 

A working model for aerodigestive tract tomorigenesis is illustrated in Figurb 1. 
'Rimorigenesis in the face of carcinogenic exposure likely involves a chronic process 
of tiissue injury and wound healing. DMA damage induced by the carcinogen is likely 
fixed into p^manent genetic changes (e.g., chromosome damage, chromosome non- 
disjunction, gene mutation, gene deletion, etc.) durinjg the process of proliferation. 
This damage would be expected to be distributed throughout the exposed tissue field 
leading to a background of generalized genomic damage (depicted in Rourb 1 as a 
background mat of increasing density). Chronic injury and repair likely leads to the 
accumulation of ceUs with increasing amounts of genetic changes as well as the out- 
growth of abnormal clones (triangles in FIGURE 1) carrying an accumulation of 
genetic changes important for selective survival, dysregulated growth, and preferen- 
tial epithelial take-over by initiated clones (see Figure 2). 

Cellular and molecular evidence for the field carcinogenesis and multistep tum- 
origenesis model comes from many laboratories.*^*^ With the advent of a wide array 
of molecular technologies, a large number of specific molecular genetic and epige- 
* netic changes involving specific oncogenes, tumor suppressor genes, cell regulatory 
genes, and repair genes have now be^ described for aerodigestive tract cancers. The 
identification of these specific molecular c hanges have now provided probes to 
explore specific events occurring in premalignant lesions adjacent to aerodigestive 
tract tumors.^^"^ Frequently, these premalignant lesions showed a subset of the 
same molecular changes found in the associated tumor, suggesting that these lesions 
might represent precursor lesions for the associated tumors (i.e., a manifestation of 
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FIGURE 1. Field cancerization and multistep tumoiigenesis. 
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FIGURE 2. Multiple focal clonal evolution during multistep tumorigenesis. 



a multistep tumorigenesis process). For example, studies of the premalignant lesions 
adjacent to head ^d neck tumors have provided evidence for a gradual accumulation 
of genetic alterations accon^anied by evidence for dysregulation of cellular control 
mechanisms (e,g., alterations in expression of PCNA, EGFR, TGF-P, p53, and 

cycUnDl).25-^8 

These types of studies have now also been applied to the target epithelium of indi- 
viduals at increased risk for aerodigestive tract cancer (i.e., individuals with a chron- 
ic smoking/alcohol history and/or prior aerodigestive tract cancer). Several groups 
(using polymerase chain reaction, PCR» analysis of microdissected epithelium) have 
now demonstrated the presence of clonal outgrowths in the target premalignant epi- 
thelium of individuals at increased risk for cancer.^^^^ For example, examination of 
bronchial biopsies derived from individuals with a 20 pack-year smoking history 
demonstrated that 76% of the cases showed evidence for LOH (3pl4, 9p21, or 
17pl 3) in at least one of six lung biopsy sites. On a per site basis, some form of LOH 
was observed in 25% of the sites examined.?' 

If aerodigestive tract cancer development reflects a field cancerization process 
involving multistep events, then risk and response inforination should be able to be 
derived firom random biopsies or exfoliated cells from the field at risk or from assess- 
ments of tissue undergoing similar processes. Hypothetically, lesions exhibiting the 
greatest degree of genomic instability, clonal outgrowth, and abnormal epithelial 
regulation would be at the highest relative aerodigestive tract canc^ risk. Similarly, 
an active chemopreventive intervention might be expected to decrease these mani- 
festations of risk. Reduced risk manifestations include decreased levels of ongoing 
genetic instability, decreased frequency of clonal outgrowths, and increased epithe- 
lial growth regulation. 
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THE MEASUREMENT OF CHIROMOSOME INSTABILITY USING 
CHROMOSOME IN SITU HYBRIDIZATION 

Molecular genetic techniques^ while extremely useful for detecting clonal chang- 
es in targets tissues, are somewhat limited in their ability to detect random genetic 
inistability. Conventional cytogenetic assays are useful for detecting chromosome 
instability and clonal chromosome changes. However, they require numbers of 
dividing cells for kaiyotypic analysis that are difficult to attain in the setting of biop- 
sies acquired during the course of a chemoprevention trial. A technique was there- 
fore needed that would allow chromosome instability measurements in situations 
where few cells are available (e.g. small biopsies, brushings, or sputum samples) and 
where the target material might be fixed. It was also desirable to have a technique 
that would be adaptable to tissue sections, whereby spatial information could be 
retained and genotype/phenotype associations could be determined on the same or 
adjacent tissue sections. The technique of in situ hybridization (ISH) involves the 
use of DNA probes that recognize either chromosome-specific repetitive target 
sequences, chromosome single gene copy sequences, or sequences along the whole 
chromosome length or chromosome segments.^^ We have adapted the ISH technique 
for formalin>fixed, paraffin-embedded tissue sections and have applied it to a variety 
of tissues, including the aerodigestive tract^^*^ 

Using probes that label the centromere regions of specific chromosomes, this 
assay permits determination of the average chromosome number per cell for each 
specimen. This assay is also useful for detecting generalized chromosome instability 
during the tumorigenesis process. Nonrial diploid populations should have two cop- 
ies of each autosomal chromosome and should rarely show three or more chromo- 
some copies per cell (chromosome polysomy), especially in tissue sections where 
nuclear truncation results in an under-representation of chromosome copy number. 
Thus, the detection of cells with three or more chromosome copies would indicate 
the presence of chromosomie instability. 

To examine this technique's potential for characterizing the multistep tumorigen- 
esis process in the aerodigestive tract, we measured the fraction of cells exhibiting 
three or more chroinosome copies in apparently contiguous epithelial transitions 
from normal to hyperplastic to dysplastic to carcinomas, all on a single tissue slice 
of head and neck squamous cell carcinomas.^ In these specimens, greater than 35% 
of the cases of adjacent "normal" epithelium, greater than 65% of the cases of hyper- 
plastic epithelium, and greater than 95% of the dysplastic and tumor regions showed 
evidence of chromosome polysomy. Of interest, similar transitions of chromosome 
instability were observed with at least four different chromosome probes. Similar 
trends have also been observed in amenable tissue from other epithelial malignan- 
cies, including cervix, bladder, and breast.^^ These results thus suggested that the 
notions of field cancerization and multistep tumorigenesis might apply to several 
epithelial tissues and that measures of chromosome instability might be useful for 
monitoring this process. 

In the situations described above, the premaHgnant lesions examined might be 
considered to represent epithelium at 100% risk of being in a cancer field, since they 
were located in the adjacent epithelium to the bancer. This then raises the question 
of the nature of genetic instability in the epithelium of individuals at increased risk 
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for developing cancer. To explore this issue, we obtained biopsies during the course 
of leukoplakia chemoprevention trials exploring the use of 13-ciJ-retinoic acid in 
reversing leukoplakia and probed them for genetic instability using in situ hybridiza- 
tion. In one retrospective study and in one prospective study of subjects with oral 
leukoplakia, the results indicate that those subjects whose pretreatment biopsies har- 
bor relatively high levels of genomic instability (ix., naore than 3% of the cells 
examined showing at least 3 chromosome 9 copies per cell) have a significantly 
higher likelihood of suffering early onset of head and neck cancer.^^*^^ Interestingly, 
half of the tumors that did develop occurred away from the biopsy site used to niea- 
sure genetic instability. TMs result suggests that genomic instability measurements 
in carcinogen-exposed tissue can provide useful cancer risk estimates. 



THE RELATIONSEOP BETWEEN TOBACCO EXPOSURE AND 
CHROMOSOME INSTABILITY 

In recent years, the aerodigestive tract chemoprevention group at M.D. Anderson 
Cancer Center has initiated three sequential biomarker-associated chemoprevention 
trials involving chronic smokers with a greater than 20 pack-year smoking history. 
4n-e ai eh-0^esc>studicsrendobgonchi^iQps i e 8 were ob tain e d fr om-siauiefined s it es 
within the lung, including the carina and at bifurcation points at the upper, middle, 
and lower right lung and at the upper and lower left lung. Biopsies were obtained pri- 
or to and following chemopreventive intervention and were subjected to iri situ 
hybridization analysis in addition to analyses for other biomarkers. The first impor- 
tant finding was that some degree of chromosome polysomy was evident in all lung 
sites examined, and this was observed independently of the particular chromosome 
probe utilized.^* This finding supports the notion that random chromosome changes 
may be occurring throughout the exposed lung field. 

In a second study, bronchial biopsies were obtained from individuals with a 20 
pack-year smoking history. In this study, most of the subjects involved were current 
simokers.^^ Interestingly, all cases who showed metaplasia at one of six biopsy sites 
also showed chromosome polysomy in at least one biopsy site; overall, 88% of the 
sites showed some evidence of chromosome 9 polysomy."*^ Evidence for genetic 
instability was also detected in patients who did not show evidence of bronchial 
metaplasia in any of six biopsy sites despite a strong smoking history. In fact, more 
than 90% of the cases and more than 60% of the sites showed significant chromo- 
some polysomy (i.e., at least three copies in at least 2 % of the cells examined). 
These results suggest that the lungs of long-term smokers show significant evidence 
of genetic instability, and this instability can be detected throughout the accessible 
bronchial tree, even when bronchial metaplasia is not evident. 

These studies in current smokers has allowed us to examine the relationship 
between the levels of genetic instability detected and subject characteristics such as 
smoking status (current or former), smoking history, and lung tissue pathologic 
changes. Evaluable biopsy material has now been obtained from more than 108 cur- 
rent smokers, including more than 480 evaluable biopsy sites. The mean metaplasia 
index in these current smokers was 30.4%, F6r the total population studied, the 
median chromosome index for the bronchial biopsies was 1.41 (range, 1.04-1.61) 
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and the median chromosome polysomy index was 2.0% (range 0^.7%). This can be 
compared to a mean chromosome index between 1.2-1.4 for lymphocytes and very 
tare chromosome polysomy. Interestingly* the intrasubject variability in chromo- 
some instability was relatively low in most subjects and was less than the intersub- 
ject variability. These results suggested that chronic smokers harbor detectable 
chromosome instability throughout the accessible bronchial tree (supporting the 
field carcinogenesis notion) and that information from one biopsy site might yield 
representative information for the rest of the lung field. 

Since most of the current smokers exhibited bronchial metaplasia in at least one 
of the biopsied sites, this allowed us to examine the relationship between chromo- 
some instability and histologic changes, both on a site-by-site basis and on a per case 
basis. On a site-by-site basis, the chromosome indices of lesions showing squamous 
metaplasia were similar to those not showing metaplasia (i.e., median 1 .43 vs. 1 .43), 
and the degree of chromosome polysomy in metaplastic lesions were only slightly 
higher than in non-metaplastic sites (medians: 2.2% vs. 1.8%, respectively). Thus, 
the presence or absence of squamous metaplasia at a biopsy site does not necessarily 
correlate with the degree of underlying genomic instability. On the other hand, those 
subjects widi metaplasia indices of at least 15% also showed higher levels of chro- 
mosome polysomy than did subjects with metaplasia index below 15% (medians: 
2.4% vs. 1.8%. p = 0.005V Thus, these chromosome instabi lity assess ments in cur- 
rent smokers appeared to reflect a more global process in the lung field. 

Tobacco exposure has been shown to significantly increase the risk of developing 
lung cancer, and the degree of risk is related to the extent of tobacco exposure. We 
were interested in determining the relationship between individuals' smoking histo- 
ry parameters and the levels of chromosome change found in their lungs following 
years of tobacco exposure. While there was significant intersubject variation for sim- 
ilar tobacco exposure histories, overall there was a significant correlation between 
the degree of chromosome polysomy and the intensity of ongoing tobacco exposure 
(packs/day, p = 0.02 on a per site basis) and with the extent of tobacco exposure 
(pack-years, p = 0.003). Thus the amount of chromosome polysomy reflects the 
intensity and extent of tobacco exposure. At the same time, individuals with similar 
smoking histories showed widely divergent amounts of diromosome polysomy, pos- 
sibly reflecting differences in intrinsic sensitivity between subjects. There was also 
strong correlation between the chromosome index and the duration of the smoking 
history (smoking years) and total accumulated exposure (pack-years, p ~ 0.0001). 
These results suggest that tobacco exposure is associated with the initiation and 
accumulation of chromosome instability in the exposed lung; however individuals 
are differentially sensitive to carcinogenic insult. The working hypothesis is that 
those individuals who accumulate the highest degree of chromosome changes will 
be at the highest lung cancer risk. 

Many of the bronchial biopsies from chronic smokers examined by in situ hybrid- 
ization showed a rise in the chromosome index above tfiat expected for a diploid cell 
population, especially in subjects with an extensive smoking history. The rise in 
chromosome index was also accompanied by an increase in the fraction of cells 
exhibiting at least 3 chromosome copies per cell. To determine if a rise in the tissue 
chromosome index was due to clonal expansion of populations with chromosome tri- 
somy, the chromosome copy number and relative coordinates of each cell scored in 
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tbe bronchial epithelium was recorded and a spatial genetic map was created."*^ We 
then developed algorithms for calculating localized chromosome indices within the 
tissue. Since trisomic clones would have, on average, three chromosomes instead of 
two, those cells involved in neighborhoods with chromosome indices three*halves 
that of diploid populations could be marked as being part of a trisomic clone. Simi- 
larly, groups of cells with chromosome indices half that of diploid populations could 
be marked as being part of a monosomic clone. Tiiis allowed the generation of a sec- 
ond-order, two-dimensional genetic map representation of the bronchial epithelium 
showing the relative locations of cells involved in monosomic and trisomic clonal 
outgrowths. When adjacent tissue sections from the same bronchial biopsy were 
probed separately for different chromosomes, the detected clones appeared to occu- 
py separate subregions of the epithelium. This result suggests that not only are the 
lungs of chronic smokers undergoing a process of genetic instability, they are expe- 
riencing the outgrowth of multiple clones throughout the exposed lung field, as pos- 
tulated by the models shown in Figures 1 and 2. One advantage of this clonal 
approach is that the contribution of both monosomic and multisomic clones can be 
detected. 

Since smoking cessation has been suggested to reduce the lung cancer risk, it was 
of interest to determine whether the levels of chromosome instability would decrease 
following smoking cessation. This question was possible to examine because our 
third sequential chemoprevention trial involved subjects who had discontinued 
smoking. So far, more than 220 subjects (more than 650 biopsies) who have quit 
smoking (mean 9.9 quit-years) have been evaluated for chromosome instability in 
their lungs. Despite the fact that the mean metaplasia index in this group is 5.8% 
(considerably less than that in current smokers), chromosome instability is still 
observed in the majority of subjects."*^ While the mean chromosome polysomy level 
is reduced to 1.0%, soiiie individuals continue to show polysomy levels above 5%. 
Interestingly, while the overall chromosome polysomy levels were reduced in these 
individuals who stopped smoking, the mean chromosome index remained at about 
1 A with some individuals exhibiting chromosome indices as high as 1.8. Initial chro- 
mosome mapping studies suggest that while random chromosome instability seems 
to decrease following smoking cessation, the clonal outgrowths may remain for 
many years in the lung. The working hypothesis is that those individuals who show 
the greatest degree of remaining chromosome instability are at the highest lung can- 
cer risk despite smoking cessation. Long-term follow-up on these subjects will be 
necessary to test this hypothesis. 



SUMMARY AND CONCLUSIONS 

Aerodigestive tract tumorigenesis appears to be a multistep process taking place 
throughout the tissue fields of exposure. When viewed in the context of chromosome 
Changes, carcinogen exposure appears to be associated with the random acquisition 
Of chromosome polysomy throughout the exposed field, the degree of which is relat- 
ed to the degree and extent of carcinogen exposure as well as to the instrinsic suscep- 
tibility of the exposed individual. Continued exposure leads to continued acquisition 
of new changes and, in association with chronic wound-healing processes, to the 
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accumulatioii of clonal outgrowths throughout the target tissue. Although the ulti- 
mate malignancy may occur in only one or few tissue sites, manifestations of the 
instability process that drives tumorigenesis is globally present in the tissue. Thus 
random biopsies may provide useful risk information for the exposed field as a 
whole. Even when carcinogen exposure is reduced or chemopreventive strategies are 
initiated and histologic manifestations of the tumorigenesis process subside, the 
genetic scars of prior exposure remain in the form of clonal outgrowths and may 
explain continued lung cancer risk in ex-smokers. Future chemoprevention strategies 
need to focus on reducing the degree of chromosome instability and on trying to 
eliminate residual abttormal clonal outgrowths in the aerodigestive tract In this set- 
tihg, the measurement of chromosome instability in the target tissue will be useful m 
assessing cancer risk as well as response to intervention. 



The studies reviewed here represent one component of the collaborative efforts 
of the Aerodigestive Tract Chemoprevention team at The University of Texas M.D. 
Anderson Cancer Cent^, Houston, Texas. The studies were supported in part by 
' al-fas titutcs of H ed&4faMon^GafleerInstttute-Grants-QA-52Q547^68^^?r 
CA 79437. CA 16672, CA 68089, CN 25433. CA 86390, CA 70907, NIH DE 13157, 
and the State of Texas Tobacco Research Fund. 
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Abstract 

Early identification and snbsequent intervention are 
fieeded to decrease the lugh mortality rate associated with 
luag cancer. The examination of bronchial epithelium for 
genetic changes could be a yaiuable approach to identify 
individuals at greatest risk. I'he purpose of this 
investigation was to assay cells recovered from 
nonmalignant bronchial epithelium by fluorescence in situ 
hybridization for trisomy of chromosome 7« an alteration 
common in non-small cell lung cancer. Bronchial 
epithelium was collected during bronchoscopy from 16 
cigarette smokers undergoing clinical evaluation for 
possible lung cancer and from seven individuals vnth a 
prior history of underground uranium mining. Normal 
bronchial epithelium was obtained &om individuals 
without a prior history of smoking (never smokers). 
Bronchial cells were collected from a s^mental bronchus 
in up to four different lung lobes for cytology and tissue 
culture. Twelve of 16 smokers were diagnosed with lung 
cancer. Cytologlcal changes found In bronchial epithelium 
included squamous metaplasia, hyperplasia, and atypical 
glandular cells. These changes were present in 33, 12, and 
47% of sites from lung cancer patients* smokers, and 
former uranium miners, respectively. Less than 10% of 
cells recovered from the diagnotlc brush had cytologlcal 
changes^ and in several cases, these changes were present 
within different lobes from the same patient. Background 
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frequencies for trisomy 7 were 1*4 ± 03% in bronchial 
epithelial cells from never smokers. Eighteen of 42 
bronchial sites from lung cancer patients showed 
significantly elevated frequencies of trisomy 7 compared 
to never smoker controls. Six of the sites positive for 
trisomy 7 also contained cytoi<^cal abnormalities. 
Trisomy 7 was found in six of seven patients diagnosed 
with squamous cell carcinoma^ one of one patient with 
adenosquamous cell carcinoma, but in only one of four 
patients with adenocarcinoma. A significant increase in 
trisomy 7 frequency' was detected in cytologically normal 
bronchial epithelium collected from four sites in one 
cancer-free smoker, whereas epithelium from the other 
smokers did not contain this chromosome abnormality, 
finally, trisomy 7 was observed in almost half of the 
former uranium miners; three of seven sites positive for 
trisomy 7 also exhibited hyperplasia. Two of the former 
uranium miners who were positive for trisomy 7 
developed squamous cell carcinoma 2 years after 
collection of bronchial cells. To determine whether the 
increased frequency of trisomy 7 reflects generalized 
aneuploidy or specific chromosomal duplication^ a 
subgroup of samples was evaluated for trisomy of 
chromosome 2{^the frequency was not elevated in any 
of the cases as compared mih controls. The studies 
described in this report are the first to detect and 
quantify the presence of trisomy 7 m subjects at risk for 
lung cancer. These results also demoiKtrate the ability to 
detect genetic changes in cytologically normal cells, 
suggesting that molecular analyses may enhance the 
power for detecting premalignant changes in bronchial 
epithelium in high-risk individuals. 

Introduction 

Although lung cancer is ihe leading cause, of cancer death in the 
United States (1), early deteclion and intervention could de- 
crease the high mortality rate associated with this disease if 
sensitive screening approaches could be developed (2~4). Early 
detection may be feasible because the entire respiratory tract is 
exposed to inhaled carcinogens; therefore* the whole lung is at 
risk for developing muitipie, independently initiated sites. This 
"field cancerization" condition (5) is supported clinically by a 
high iiequency of second primary tumors in lung cancer pa- 
tients (6-9) and by the occurrence of progressive histological 
premalignant chatiges throughoui the lower re.spiratory tract of 
cigarelle smokers (10, il). Moreover* recent .studies using 
pathological tissues obtained after lung resection or autopsy 
have identified geneiic aberralions associated witli lung cancer 
in nonmalignant bronchial epithelium adjacent to tumors (12- 
16). 

Although examination of pathological samples is useful 
for identifying genetic changes associated with carcinogenesis, 
this invasive approach for collection of clinical samples nec- 
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essary for early detection would not be appropriate for screen- 
ing. However, bronchial epilhelial ccUs harvested using routine 
clinical procedures could be examined for genetic changes as an 
initial approach for delecting individuals at high risk for. lung 
cancer. This approach could also provide genetic markers for 
evaluating the effectiveness of chemoprcventiou regimens. 
Bronchoscopy provides direct access lo viable cells within the 
ainvays and is a commonly used tool for obtaining samples 
from the lower respiratory tract, including bronchial epithelium 
(17). This procedure can be used to repeatedly sample the 
bronchial epithelium over itme and to collect viable cells that 
can be expanded through tissue culture for functional assays. 

Because of field cancerization, genetic abnormalities 
should be dispersed throughout the bronchial epithelium of 
pcisons al risk for lung cancer. Hie purpose of this investiga- 
tion was to test this hypothesis by sampling nonmalignant 
bronchial epithelium from distinct locations within four differ- 
ent lobes of the lung from persons at risk for lung cancer and 
then assaying the bronchial cells for the presence of specific 
genetic abnormalities. Trisomy of chromosome 7 was exann- 
ined in these cells, because this alteration is common in solid 
tumors, including lung cancer, of several different organ sys- 
tems (18. 19). in addition, trisomy 7 has been detected in 
premaiignant lesions such as villous adenoma of the colon (20), 
in the colonic mucosa of individuals with familial polyposis 
(21 ), and in the far margins of some. resected lung tumors (22). 
Our results demonstrate that trisomy 7 can be detected in 
nonmalignant bronchial epithelium from patients with lung 
cancer distant to the siie of the tumor and in individuals without 
tumors who arc at high risk for lung cancer development. 
Together, these studies suggest lliat an cxUti copy of chromo- 
some 7 may be an intermediate biomiarker of ongoing field 
carcinogenesis. 

Materials and Methods 

Subject Recruitment. Bronchial epithelium was collected 
from 16 cigarette smokers undergoing a diagnostic workup for 
possible lung cancer and from 7 individuals with a prior history 
of underground uranium mining. 5 of whom were also smokers. 
Three individuals who had never smoked were also recruited to 
obtain bronchial epithelium not exposed directly to either to- 
bacco smoke or radon progeny. 

Pathology and Exposure History, Twelve of the 16 cigarette 
smokers who underwent diagnostic bronchoscopy were diag- 
nosed vdth NSCLC.^ Seven tumors were characterized histo- 
logically as SCCs, four tumors were ACs, and one tumor was 
an adenosquamous'ccH carcinoma. Lung cancer was not evi- 
dent in the other four subjects. Smoking histories ranged from 
15 to 120 pack-years (defined as the number of cigarettes 
smoked per day times the number of years smoked). All of the 
former uranium miners worked underground between 2 and 20 
years, with a range of 27-527 working level months. Five of the 
seven miners had smoking histories that nuiged from 20-60 
pack-years. 

Broncho-scopic Collection and PnKessmg of Bronchial Ep- 
ithelium, A protocol was developed for harvesting viable 
bronchial epithelium from the lower respiratory tract using a 
standard cytology brush during bronchoscopy. After introduc- 



tion into the lower respiratory tract, the bronchoscope was 
directed into each upper and lower lobe, and the carinal margin ' 
of a segmental orifice, usually the second and third bifurcation 
within the tipper and lower lobes, respectively, was brushed. 
These sites were chosen because (a) they arc bigh-depositian 
areas for particles; {b) Ihey arc associated frequenUy with 
histological changes In smokers; and (c) they represent sites 
where tumors commonly occur (11, 23). The area was first 
washed with saline tO remove any nonadherent cells. Sites were 
not hmshed if a tumor was visualr/.ed within 5 cm of the site. 
After brushing, the brush was withdrawn, placed in serom-free 
medium, and kept on ice until proces.sed. Bach site was brushed 
twice. The procedure was well tolerated by aU subjects, and no 
complications were noted related to the brushing procedure. 

Bronchial cells were collected from only two of the sites 
in two of the subjects, from three sites in two subjects, and from 
all four sites in the remaining subjects. Although only two sites 
were brushed initially in case 1, cells were obtained from all 
four sites in this subject during a repeat bronchoscopy per- 
formed after the initial procedure did not yield a diagnosis. 
Samples were obtained from all four sites in the cancer-free 
current smokers and in the never smokers. In addition, bron- 
chial epithelial cells derived at autopsy by Clonetics. Inc. (San 
Diego. CA) from four never smokers were also obtamcd to 
serve as additional controls. Only two sites sampled from most 
of the former m-anium miners were available for analysis be- 
cause cells recovered from the other sites had been used ex- 
clusively for cytology in another investigation.'^ 
Bronchial Epithelial CeU Culture. Replicative cultures of the 
bronchial epithelial cells obtained by the procedure described 
above were established in our laboratory (24) using a scrum- 
free medium (BEGM: Clonetics, Inc.) that is optimal for grov^lh : 
of these cells. Cells were renioved from brushes by vigorous 1 
shaking in BEGM; cells from one brush were prepared for j 
cytologicai analyses, and cells from the other brash were j 
washed, rcsuspended in BEGM, seeded onto 60-mm fibronec- ^ 
tin-coated plates, and grovra at 37*^C in 3% CO^ and 21% O, j 
until 80% confluence. Prior to passage, aliquots of ceils were : 
cryopreserved and stoied at -145^C; other samples of cells ; 
were fixed in methanol-acetic acid (3:1). Next, the cells were , 
washed four to six times in methanol: acetic acid and then ; 
dropped onto slides (about 2 X lO^cells/slide). The effects of 
cell culture on the frequency of urisomy 7 in nonmalignant 
bronchial epithelium were examined by placing cells dispelled 
from brushes directly onto microscope slides followed by fix- 
ation. 

Cytology. Cells from one brush from each bronchial collection 
site were prepared for cviological analysis by smearing the cells 
across a microscope slide. The cells were then fixed wi'th 96% 
ethanol and stained according to the Papanicolaou procedure 
(25) to facilitate morphological evaluation by a cytopaihologist. 
Detection of Trisomy 2 and Trisomy 7. Trisomy 2 and tri- 
somy 7 were determined by hybridization of cells with a bjou- 
nylated chromosome 2 or 7 centromere probe (Oncor; GsitW- 
ersburg, MB). The probes were denatured in hybridizanon 
buifer at for 5 min, and the slides were immersed jr ^.v 
formamide-2X SSPE at 7()^C for 2 min. Tlic probe was then 
applied to the slides, which were incubated in a humiciiiJca 
chamber ai 37*'C for 16 h. After incubation, the slides were 
washed in 0.25X SSPE (10 mM sodium phosphate monobasic 
monohydrate; I mw ethyiencdiamine tetraacetic acid disodium 
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Tflc abbreviations used arc: NSCXC non-smajl cell lung cancer: SCC. squa- 
mous ceil cancer. AC. aticnocarcinoma; HOFR. epidermal growth factor receptor; 
FISH, nuiinrsccnce in situ hybridizaUon; l-OH, loss of heterozygosity; BEGM» 
Bronchial Cpilhelium Growth Medium. 



■* Unpublished data. 
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salt, dihydiate; 150 mM sodium chloride, pH 7.4) for 5 min at 
72*C and the probe, wits delected with fluorescein-labeled 
avidin. Cell nuclei were visualized with propidlnm iodide. 
Pata Analysis* The number of centroineric hybridization sig- 
nals in each cell were evaluated in 400 cells/slide, and the 
frequency of trisomy 7 on each slide was calculated by dividing 
the total number of cells expressing three hybridization signals 
by the total number of cells counted on each .slide. Twenty % 
of the slides were scored by a second person, and frequencies 
for trisomy 7 differed by <0.4%. The total number of sites 
positive for trisomy 7 in subjects with SCC and AC were 
compared using Fisher's exact test 

Results 

Cytology. Squamous metaplasia and atypical glandular cells, 
the only cylological abnormalities observed in lung cancer 
patients, were present in 32% of the samples (Table 1 ). These 
cytological changes were observed in <10% of the cells re- 
covered from the diagnostic brush. Two subjects had three sites 
with cytological abnormalities, and five subjects had no cyto- 
logical abnormalities. No samples contained tumor cells by 
cytology, although one of four sites in five subjects was col- 
lected from the same lobe where a tumor was later diagnosed. 

Two of the 16 sites in smokers without lung cancer were 
cytologically abnormal (both in the same person; Table 2), 
whereas no atypical ceJls were present in the 12 sites from tile 
three never smokers (Table 3). In former uranium miners, 
hyperplasia was present in bronchial cells collected from all 
four sites from one person, and in one site in two additional 
people (Table 2). 

Culttiring of Bronchial Epithelial Cells. Tht efficiency of 
establishing repllcaiive cultures of the cells obtained by bron- 
chial brushing was M)0%. The serum-free medium used for 
those cultures is optimal for growing bronchial epithelial cells 
and does not .support fibroblastic cell rephcation (25). There- 
fore, the cells were uniformly epitheloid in appearance. Growth 
potential was evaluated by passaging cells from all seven of the 
uranium miner cases and cases 1-6 from the lung cancer 
patients. Some of these cultures were maintained for up to nine 
passages (a minimum of 16 population doublings), and many 
underwent 30 divisions before senescence. However, none ex- 
hibited an indefinite population-doubling potential. 
Detection of Trisomy 7 in Nonihalignant Bronchial Epithe- 
lium. Background rates of trisomy 7 were determined by ex- 
amining normal human bronchial epithelial cell lines derived 
from autopsy cases of never smokers and bronchial epithelium 
collected from never smokers during bronchoscopy. In bron- 
chial cell lines (passage 2) from four donors and bronchial 
epithelial cell samples obtained by bronchial brushing from the 
recruited never smokers CFable 3). only 1 .4 ± 0.3% (SD) of the 
cells contained three hybridization signals for chromosome 7 
with values ranging from 1 to 1 .8%. These viUues agree with 
those reported by the manufacturer of the probe. Therefore, 
trisomy 7 frequencies of >2.0% (>2 SD above the mean for 
controls) were considered significantly different from controls. 

Passage 1 or 2 bronchial cells from lung cancer patients 
were examined for trisomy 7. Eighteen of the 42 bronchial sites 
(43%) sampled from the 12 lung cancer patients contained 
trisomy 7 at frwjuencics ranging from 2.3 to 6.0% (Table 1 ; Fig. 
J ). Three subjects (cmses { , 2» and i 1) displayed trisomy 7 in all 
sites collected during bronchoscopy, and in two subjects (cases 
7 and 12), irisomy 7 was found in three of four sites (Table 1). 
Six of the 18 sites positive for trisomy 7 also contained cyto- 
logically abnormal cells. Trisomy 7 wa.s found in six of seven 



Table I Frequency of rrisomy 7 in bruochiaJ epithelial ceils from lung 
patients 



Case 


Age 




Tnmor 


tocstiofi 


diagnosis 


TVicumv 7 

(freouency. %) 


I 


64 


104 


see 


BLL" 


N 












RUL 


AGC 


4.0^ 










RIJJ^ 


N 


3.0^ 










RUL*- 


K 


4.0* 












N 


6.0^ 










ixyu 


SM 


4.3*^ 


2 


69 


26 


SCC 


KUL 


SM 


2.8* 










UX 


SM 


3.3* 










LUL 


N 


3.8* 


3 


65 


J20 


SCC 


RLL 


AGC 


2.0 










RUL 


AGC 


2.3* 










LLL 


AGC 


2.0 


4 


52 


90 


AC 


RLL 


' SM 


IJ 










RUL 


N 


LB 










LLL 


S.M 


1.5 










LUL 


SM 


1.8 


5 


70 


50 


SCC 


RLL 


N 


1.5 










RUL 


N 


1.5 










LLL 


N 


1.5 










LUL 


SM 


1.3 


6 


61 


93 


AC 


RLL 


N 


1.5 










RUL 


N 


1.3 










UX 


N 


2.0 










LUL 


N 


1.5 


7 


58 


40 


SCC 


RLL 


N 


1.8 










RUL 


N 


2.3* 










LLL 


N 


2.5* 










LUL 


N 


2.8* 


H 


59 


120 


AdSCC 


RU. 


N 


. 1.5 










RUL 


N 


2.0, 










LiX 


N 


2.5* 










LUL 


AGC 


2.0 


9 


65 


71 


SCC 


RLL 


SM 


2.0 










RUL 


SM 


2.5* 


10 


63 


45. 


AC 


RLL 


N 


LO 










RUL 


N 


1.8 










LU. 


N 


I.R 










LUL 


N 


1.3 


II 


61 


95 


AC 


LLL 


N 


2.5* 










LUL 


N 


2.8* 


12 


76 


17 


SCC 


RIX 


N 


2.0 










RUL 


N 


2.3* 










TH 


N 


2,3* 










Llff- 


N 


2.3* 



RLL, right lower iobc; RUL, right upper lobe: LLL, left lower lobe; LUL, left 
upper lobe; AGC. atypical glandular cells; SM, squamous metaplasia; N, normal 
cells; AilSCC, a*lenosquamou.<! carcinoma. 
* P < 0.05 as compared to never-sinoker controls. 

Resampled 4 motttbs later. 



palients diagnosed with SCC, whereas only one of four patients 
wth AC displayed trisomy 7 in any site collected at bronchos- 
copy. Case 7, which had histological features of both SCC and 
AC, had one site positive for trisomy 7. The frequency of 
positive trisomy 7 sites in all paiienis with SCC within this 
small sample population was significantly greater than in AC 
patients {H < 0.005). 

The reproducibility of detecting trisomy 7 at sites found to 
be positive for this abnormality was investigated in one patient 
(case 1) who required repeat bronchoscopy for clinical reasons. 
Trisomy 7 was increased similarly in the two sites brushed 
during both procedures, although cytological examination 
showed atypical cells in one site from the first bronchoscopy 
and cytologically normal cells from the same site collected 



TMe 2 of trisomy 7 in b«>nduid q.id.elUl cells r«« c«^bu 
smokers and fonner uranuiro miners 



i3 


81 


15 


n 
u 


RU. 
RUX 
LLL 
LUL 


N 
AGC 
N 

SM 


14 


34 


24 


V 


ULL 
RUL 
LLL 
LUL 


N 
N 
N 


15 


68 


51 


0 


RUL 
LU, 
1 in 


N 
N 
N 
N 


16 


45 


30 


0 


RLL 
RUL 
LLL 
LUL 


N 
N 
N 

IN 


17 


59 


8 


27 


LLL 
LUL 


N 
N 


18 


65 


9 


516 


LUL 
RUL 


N 
N 


19 


64 


30 


235 


LUL 
RLL 


N 
N 


20 


56 


0 


186 


LUL 
RLL 


N 
N 


21 
22 


64 
64 


U 
9 


214 

5'n 


RLL 
LUL 
RLL 


H 
N 
H 


23 


67 


31 


124 


LUL 
RLJ. 
KUL 


H 
}) 
H 
H 



L8 

2.0 

1.3 

1.3 

1.0 

13 

4.0^ 

3.0^ 

4.3* 

3.5'' 

1.3 

1.5 

2.0 

1.8 

3.0* 

3.0* 

1.3 

3.3* 

1.5 

1.0 

2.0 

2.3* 

1.8 

LS 

0.8 

L3 

2.8" 

2.5" 

3.3^' 



lnt«ph«« analysis Of chimiosTHnc 7 mnorn>alhu^^ bfonchial 
cjulhclialccUs 



were cuonted per donor ^ 



\ 



H, hypcrpla.'iia. , ^ , 

" P < 0.05 as compared io ncvcMinokcr controls. 



during the second procedure (Table 1). 'Fhe o*er two s.tes 
Sed during the^ond bronchoscopy alsoshowedelevated 

frequencies of trisomy 7 in this pauent. htt,nchial 
Trisomy 7 was delected in cytologically normal bronchial 
epitheUum collected from four sites in «'-,/=f^J 5) of *e 
cLcer-free smokers (Table 2). Bronchial ^^^^J^J^^f^ 
smoker, did not contain this chromosome «bnomahty In fte 
former uranium miners (cases 17-23) seven of l.-> sites crt 
Sd during bronchoscopy were positive for tnsomy 7. Three 
theSve sites were found in one subject (case 23) and also 
. cIS SLal cell hyperplasia. However the o*er four sam^^ 
pies positive for trisomy 7 showed no cytological abnormality. 
^ Two of the former uranium miners ^^'fJ^J 
developed lung cancer within 2 years of broncbal coll^ 
don sec wa.*diagnosed in ihe right upper lobe of both sub- 
S AS notii in Table 2. boU. cases were positi ve for trisomy 
Ttathe St upper lobe brushing site obtained at the mitiai 
bronchoscopy. . . 

Tissue Cuttare Effeds on Trisomy 7 Expression in Bron- 
cS Epithdium. The effect of tissue culture on tnsomy 7 
frequency was assessed by comparing the frequency of this 
SSme abnormality in freshly i^lated b-nchif^^^^^^ 
lium obtained directly from bronchial brushes ( precuUore )to 
l^e I cells. This comparison was conducted on ce 
Kfron. two different bronchial s«es m three different 
sufaiects r(ca.ses 11 and 16 and donor 7 (never smoker;]. Cul- 
SSd W^es positive for trisomy 7 in case 11 were also 



Donor 



Age 



Bnish 
locanion 



I 



1 

2 
3 
4 
5 



6 

n 

15 
41 
45 



35 



33 



NA'^ 

NA 

NA 

NA 

RLL 

RUL 

LU- 

LUL 

RLL 

RUX 

LLL 

LUL 

RLL 

RUL 

LIX 

LUL 



3J 

23 

13 

2.0 

LO 

03 

1.3 

LO 

LO 

23 

2.0 

L8 

0.5 

0.5 

L2 

LO 



sigoaWccll {%) 


2 


3 


4 


92.0 


13 


3.0 


953 


13 


1.0 


94.7 


1.8 


2.0 


94.H 


1.0 


23 


953 


1.8 


i.7 


9S.3 


LO 


0.2 


963 


LO 


1.2 


963 


1.2 


13 


96.8 


LO 


1.2 


933 


1.7 


23 


94.8 


13 


1.7 


94.2 


1.8 


2.2 


98.2 


0.8 


03 


97-2 


13 


1.0 


96.8 


13 


0.7 


96.0 


L5 


13 



FiH 
bro 
is • 
ficl 



^ Abb«vl.ri«„ « « i»aica.«J in the legend » Table iiNA^. nol applicable. 



nositive in preculiure cells from liie same bronchi^ coUecuon 
!ke whereas sites negative for trisomy 7 in cultured cells from 
cS; TS the never smoker were also negative m precukute 
cells date not shown). Values for trisomy 7 diifered by <0.3?r 
b^ttS pSSre aid cultured cells. The effect of passa^ng 
S^n the frequency of trisomy 7 was also examined m 
ScS c^lls fiSm case 1. Trisomy 7 6eqt«ncy was s.nular m 
cells from passages 1. 4, and 7. 

Freauency of Trisomy 2 in NonmaHgnant Bronchial Epi- 

S Leuploidy lis been detecg^n bronc^al s^^mous 
metaplasia, a likely precursor to SCC (26)- To ft^tennm 
whether the increased frequency of tnsomy 7 det'^^ted m fte 
Zrent study reflects generalized ancuploidy or a specific chro- 
™rdilication,lsubgroupof samples w^^^^^^^ 
trisomy of chi«mosome 2. The frequency of tnsomy 2 !" "^ver 
.mofccrs was 1 5 ± 0.4% (daU not shown). Bronchia cells 
torn elhTrub ects. six of whom had elevated frequcnc.es for 
SmT' were evaluated. The frequency for uisomy of chro- 
mSo^ 2 did not differ from never smokers (Table 4). 

Discussion . . . „j 

Tte studies described in this report are the first to detect a.nd 
^nSy ^increa.^ in trisomy 7 in the airway ce^ ofsubjec« 
S for lung cancer. The presence ot tnsomy > appeared to 
be a sp^ific'chromosome gain and not to genet^jed 
aneupl^dy in these cells. In addition, tnsomy 7 m no"mal.g- 
JiiJSelium from lung cancer patients was associated wi^ 
Sec ti histology, suggesUngthat Pa«ents -A .lus g^^^^^^ 
chance may be at greater risk for developing SCC than other 
SlogS fonns of lung cancer. This .suppos.uon was sup- 
Sd by the fact that two cancer-free former uranmm mmers 
S bronchial cells positive for trisomy 7 "h^n^^J^ ^rea 
SCC Fmally. these results demonstrate the ability to deteU 
Sic Ses in cytologically nonnal cells, suggesung tha 
Secular analyses may enhance the power for detecting 
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Fig. 1. FISH for ciiromosomc 7 in 
bronchial epithelial ceils. Trisomy 7 
is apparent in one ccii from this 
field Magntfication, X530. 




Ilection 

h from 
iculture 
<03% 
staging 
ined in 
.Tiilar in 

Hi EpJ- 

jamous 
termine 
I in the 
ic chro- 
Ued for 
n never 
al ceils 
:ies for 
if chro- 
1. 



2ct and 
ubjects 
ared tti 
irallzcd 
imaii«- 
?d with 
genetic 
n oth^r 
as sup- 
miners 
•eloped 
delect 
ng that 
lecting 



Tabte 4 i-retiucncy oi* irisomy 2 in biTonchial epithelial cclis from lung cancer 
patients, cancer-free smokers, and former uranium miners 



Case 


Tumt>r tliagnwis 


Brush location 


Trisomy 2 
(rrec|iiency. %) 


i 


sec 


KLL" 


1.5 






RUL 


1.8 






LLL 


L8 






LUL 


1.0 


2 


sec 


iXJL 


l.O 






LUL 


LO 


7 


sec 


RUL 


1.5 






RUL 


2.1 






LLL 


1.8 






LLfL 


1.5 


8 


AC 


RiX 


03 






Ria. 


i.5 






LLL 


0.8 


i3 


None 


RLI. 


LO 






RUL 


0.8 






LLL 


1.0 






LLl. 


1.3 


15 


None 


RLL 


1.8 






RUL 


2.0 






LLL 


LO 






LLfL 


1.3 


J9 


None 


LUL 


1.9 






RLL 


0.8 


23 


None 


RLL 


1.5 



" AbbiTviations arc as indicated in legend lo Table J. 



premalignant changes in bnjncbial epithelium in high-risk 
imdividuais. 

Cigareiie smoking and the exposu.re of underground min- 
ers to radon progeny are both well-e.stablished respiratory car- 
cinogens (27, 28), Tobacco .smoke contains numerous muta- 
gens and carcinogens, and radon progeny that have been 
inhaled and deposited on ihe respiratory epithelium release or 



particles capable of damaging DNA (28). Although comparison 
between findings in the cigarette smokers and the former ura- 
nium miners is constrained by the number of participants in the 
two groups, trisoHiy 7 was found in both groups. These results 
are consistent with the synergism between smoking and radon 
progeny, which suggests commonality in the pathways by 
which the two carcinogens cause lung cancer (29). 

The bronchial brushing method used for collecting cells 
from the lower respirator)' tract is rapid (10-12 min total for 
two brushes at four different sites), well tolerated by the patient, 
and permits collection of viable bronchial ceils that can be 
expanded through tissue culture at J 00% efficiency. The sta- 
bility of these cells in culture was evident by the fact that the 
frequency of trisomy 7 did not differ between primary brash 
cells and cells propagated for up to seven passages. Further- 
more, this procedure is amenable to the production of sufficient 
cell numbers (1, X 10'*) at low passage (one or two) to accom- 
modate multiple molecular analyses. Although the media used 
in cuhuring of bronchial epithelial cells did not appear to 
provide a selective growth advantage to cells harboring an 
additional chromosome 7, the modulation of medium supple- 
ments might lead to the establishment of clonal populations of 
premalignant cells. Such cell populations would greatly facil- 
itate the identification of additional early gene changes in 
respirdlory carcinogenesis. 

The detection of trisomy 7 in multiple nonmalignant silcs 
within the bronchial iree supports the iheory of fjeld canccr- 
i/ation (5), which .states that diffu.se exposure of the entire 
respiratory tract to inhaled carcinogens causes the development 
of inuitiple, independently initiated .sites daat can lead to tumor 
development. Although the frequency of Uiis chromosome ab- 
normality was relatively low (2.3-6.0%), these values were 
consistent with the low percentage of cells within each brush 
sample (10%) that exhibited abnorarjal cytology. These results 
are also similar to studies of chromosome gain in patients with 
head and neck cancer where trisomy 7 was detected at frequen- 
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cies of 2, X and 21 % In histologically normal, hyperplastic, and 
dysplastic cells, respectively (30), 

The detection of trisomy 7 in normal, hyperplastic, and 
metaplastic bronchial epitheltum from cancer-free patients ex- 
tends a recent report describing LOH at chromosomes 3p, 5q, 
and 9p in dysplastic premalignant bronchial lesions han ested 
from current and fonner smokers by bronchoscopy (31). The 
inability to detect LOH at these chromosome loci in normal or 
early premalignant epithelium may stetn liom a difference in 
sensitivity between the mediodologies used. The low frequency 
of trisomy 7 and cytologically abnormal cells collected from 
bronchoscopy is consistent with a lack of clonality within the 
brush cells. FISH assays on interphase cells permit screening of 
individual cells, and sensitivity for detection is limited only by 
the number of ceils examined. In contra.st» microsatellile anal- 
yses for LOH cannot detect nonclonal changes but require that 
the chromosome alteration be present in approximately 40- 
50% of the sample (32. 33). 

The role of trisomy 7 in lung cancer development has not 
been elucidated. Increased expression of EGFR, which is lo- 
cated on chromosome 7 (34), is observed In 50-80% of 
NSCLCs (16, 35. 36). EGFR expression appears greater in SCC 
than AC (35, 36) and is amplified in some cell lines derived 
from SCC (37). These findings corroborate our hypothesis that 
acquisition of trisomy 7 in bronchial epithelium could be prog- 
nostic for development of SCC. Moreover, expression of this 
gene is also increased in nonmalignant bronchial epitlielium 
from NSCLC patients (16, 35) and in normal or premalignant 
epithelium adjacent to bead and neck tumors (38). Thus, altered 
expression of EGFR could enable cells that have acquired 
additional genetic changes to proliferate continually and escape 
from terminal differentiation (39). In addition, the c-met onco- 
gene is also locaied on chromosome 7 and is overexpressed in 
NSCIX^s (40, 41). This oncogene encodes a transnierrjbrane 
tyrosine kinase (42) that functions as a receptor for the hepa- 
locyte growth factor (43) and is involved in sustaining the 
growth of NSCLC cells iii culture (44). 

Previous studies have delected mutations in p53 (12, 14, 
35), chromosome losses at 9p2 1 (45) and 3p (46) in preinvasive 
bronchial lesion.s, and simple chromosome rearrangements in 
normal bronchial epithelium from proximal airways (47) of 
lung cancer patients. The prevalence of these genetic changes in 
normal epithelium from persons at risk for lung cancer should 
be quantified by FISH to defme the temporal sequences of 
somatic genetic changes that precede the development of clonal 
lesions in the lung. This information will be invaluable in 
providing biological markers that can qualitatively estimate the 
extent of field cancerization in persons at risk for lung cancer 
and can be used to assess the efficacy of chemoLntervention 
trials. Ultimately, the efficiency for detecting these biological 
markers in bronchial epithelium versus exfoliated epithelial 
cells within sputum must be established to support the use of a 
'*genetic-based" screening approach for individuals at high risk 
for lung cancer. The results of the current investigation have 
identified one potential biomarkcr, trisomy 7, that may be 
useful in early detection and intervention for lung carcino- 
genesis. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identiflcation 
of two genes, WISF-1 and WISP-l, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3y these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24.3. WISP-l genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in ^% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine- rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase actWity of the normally constitutively active glycogen 
synthase kinase-3^ (GSK-3P) resulting in an increase in 
j3-catenin levels. Stabilized ^-catenin interacts with' the tran- 
scription factor TCF/Lef 1, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
)3-catenin levels (9). APC is phosphorylated by GSK-SjS, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or j3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoidy twin (Xtwn), and siamois 
(2). A recent report also identifies z-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno-. 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP'2, and a third related gene, WISP-3, The genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
noVy a family not previously Hnked to Wnt signaling. 

MATERIALS AND METHODS 

SSH, SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778. AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail; diane@gene. 
com. 
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cDNA was synthesized from 2 /tg of polyCA)**" RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poly(A)-' RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-J were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt'l or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /xM of each dNTP at 
94**C for 1 sec, 62**C for 30 sec, 72*C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. ^^P-labe!ed sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCI cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2(^ct) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The H75P-specific signal was 
normalized to that of the glyceraIdehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-l and WISP-2 by SSH. To identify Wnt- 
1 -inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP'2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and 5). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-I was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-foId) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-L The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of «*40,000 {Mr 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. Z4). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ^27,000 {M, 27 K) (Fig. IB), Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP'2 has one at 



C57MG 

Parent Wnt-l Wnl-4' 




Fig. L WISP' I and mSP'2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-l (A) and 
mSP'2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. PoIy(A)+ RNA (2 fxg) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- 1 -specific probe 
(amino acids 278-300) or a 190-bp IW5/'-2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WlSP-l {A) and mouse and human WISP'2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP'2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-L 

Identification of WISPS, To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WlSP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP'3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N- terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 



— — ^ — i . Ki>- Ui'OiiWMiii , 

kllBvEHZO^ f fTvcTl y LF1^ F^:> Tj rh|5FlKtrI ^ fATTH^ Q ^^ r- 1. p £\^f \i^^iB 



tji Vive T P i.>:|i.s V ft r f s 



f H r K H A t y \]i\r f tv c c V <; " oDr, q 
}• • - • H G A r. 'ip?^ ' r-c Tfiy ■ a h K oyu s T 3 r 



;s« * pl£» A V 0 it K A 13.-. 5 kOi A V -" S I^pfS 

'T ? vys '5 i. v ^ri!' ? V p r ? p: w all* 



U -i'' BtL'-'-P J* S K J i- ^ a K B A ■" 



1*4 thhh I yfg f Ujf i l 



B. 



(SxOI[IIIXO]n[I3I>SCE0I) *»^« 
TitT-V==OTTn yi HI ^ w«« 



>. S ^ 3 

•illlfil!' 



WISP'1 
WISP>2 
WlSP-3 
G3PPH Q>| 



Adult 



y I u 



DEDBBBBB 

Fetal 



FiG. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WlSP-1 and WlSP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WlSP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identiiSed in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-l had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISP-S was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

Jn Situ Localization of WISP-^l and WTSP'2, Expression of 
IVISP-I and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP'] was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP'l expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WlSP-h WISP'2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-l transgenic animals 
(Fig. 4 E-H). However, WISP'2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 
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Fig. 4. {A, C, £, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WJSP-1 expression are shown in B and 
Z). The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and 5), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1 y however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
ampli^cation has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-m>'c, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PGR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PGR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 {P < 0.001). 

The levels of WISP transcripts in RNA isolated from .19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5 . Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PGR. (B) Southern blots containing genomic DNA (10 /tg) 
digested with EcoRl (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 



Cell Biology, Medical Sciences: Pennica et ah 



WISP-l 




1litiiiiiiiiiiiiiii.i,tiii 



Tumor Number 

Fig. 6. Genomic amplification of WJSP genes in human colon 
tumors. The relative gene copy number of the WJSP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-l 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-l 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-l, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based ort assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-l, WISP-2, and WISPS, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., p-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l^transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through i3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-l, and WlSP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-p, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-l and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin avft serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-l and WISP-2 in cells lying 
within the fibrovascular tumor, stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-j31, which is the stimulus for 
stromal proliferation (34). TGF-)31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-l and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-l gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen, in the absence of DNA amplification. 
In contrast, WISP'2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-l, the rat orthologue of 
describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down -regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic ]3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WlSPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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Paul A. Haynes pToteome analvsis: Biological assay or data archive? 
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Daniel FIgeys ^j^j^ j-^view we examine the current stale of proteome analysis. There are 

Ruedi Aebersold ^j^^^^ ^^^^ issues discussed: why it is necessary to study proteomes; how pro- 

r I I teomes can be analyzed with current technology; and how proteome analysis 

Department of Molecular ^^^^ enhance biological research. We conclude that proteome anal- 

Biotechnology, ^"'^^^f^ J ysis is an essential tool in the understanding of regulated biological systems. 

Washington, Seattle, WA, USA Current technology, while still mostly limited to the more abundant proteins, 

enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 
technology. 

Contents resolution two-dimensional gel electrophoresis (2-DE), 

. -j^^ detected in the gel and identified by their amino acid 

1 introduction . , I8&i sequence. The ease, sensitivity and speed with which gel- 
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2.1 Correlation between mRNA and protein developed mass spectrometric techniques have dramati- 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [1], In the most comrhon im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([15-17]). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy [19, 20] and serial analysis of gene expression 
(SAGE) [21, 22), it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 
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protein analysis technology. Given the long-standing 
paradigm m biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions whidi arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from Uie gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharontyces cerevlsiae 
growmg at mid-log phase (S. P. Gygi et aL, submitted for 
publication). mRNA expression levels were calculated 
from pubUshed SAGE frequency tables [22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counUng 
ot the protein spots separated by high resolution 2-DE 
and mass spectrometric identification of the proteinfs) 
migrating to each spot. Hie selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Ihus far, we have found a general trend but no 
sU;ong correlation between protein and transcript levels 
I). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
vaned by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cnpt levels varying by as much as 40-fold (S P Gygi 
et al submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
irom the level of the corresponding mRNA transcript 
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are important indicators for the state of the system. The 
type of protein modificaOon and the sites modified at a 
specific cellular state can usually not be determined 
irom the gene sequence alone. 

23 IVoteomes are dynamic and reflect the state of a 
biological system 



2:1 Pititeins are dynamically modified and processed 

In tJie mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenylation. acylation, ubiquitination or one or 
more of many other modifications [231 and many pro- 
teins are only functional if specifically associated or com- 
plexed with other molecules, including DNA, RNA pro- 
teins and oiBanic and inorganic cofactors. Frequently 
modifications are dynamic and reversible and may alter 
me precise three-dimensional structure and the state of 
SHf ih "^-^'^ Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 



A smgle genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle ^and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress 
and pathological conditions represent cellular states 
which are characterized by significanUy different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translational and post-translational con- 
trol. It IS therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cnption of the state of a cell or tissue, provided Uiat the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
ot proteomes is necessaiy for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
em products in cells is essential as there are numerous 
levels of control of protein synthesis, degradation 
processing and modification, which are only apparent by 
direct protein analysis. ^ 



3 Description and assessment of current proteome 
analysis technology 

3,1 Ibcfanical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system, 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis* From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isofonms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solveiit conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDSX are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
tedmiques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proteomics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis — mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis tedinology is based on the separation 
of proteins by two-dimensional (lEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (lEF) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25] and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using iV-terminal sequencing (26, 27], 
internal peptide sequencing [28, 29], immunoblottihg or 
comigration with known proteins (30). The recent dra- 
matic growth of large-scale geilomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31-33). 

There are a number of alternative approaches to pro- 
teome analysis cunently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most Significant strengths of the 2-DE-MS approach, 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A sdieniatic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28, 34) or digestion after electrotransfer out 
of the gel onto a suitable membrane ([29, 35-37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually- Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases* 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid diromatog- 
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Figure 2. Schematic diagram of a procedure for identification of.gel- 
scparatcd proteins. Peptides can either be separated by a technique 
such as LC or CE, or inruscd as a mixture and sorted in the MS. Data- 
base searching can cither be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy [39» 40] or capillary electrophoresis [41], the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33). The system automatically performs the 
follov/ing operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 

3J Protein identification by LC-MS/MS, capillaiy 
LC-MS/MS and CE«MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants: We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



1865 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42J. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillaiy LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 Mm ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow splitr 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
et aL, submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel (40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3J.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level (43, 44], While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schematic illustration of a 
microfabricated analytical system for C£, 
consisting of a micromachined device, 
coated capiHaiy electroosmotic pump, 
and microelectrospray interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from [45], with permission. 



microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 in depth and 50-70 \im in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig, 3). The channels are 
connected to an external high voltage power supply [451. 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the suiface. 

The type of data generated by the system is illustrated in 
whidi shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/jiL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically selejct each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per IS min. 

3.4 Assessment of proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 
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Figure 4, MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/ML of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from (45)» with permission. 
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calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fled in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified p> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast,' 
codon bias values calculated for the entire yeast genome 
(Fig. 6a) show that the majority of proteins present in 
the proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells II6J. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products (47]. Inherent limitations on the amount 
of protein that can be loaded on 2-'DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE f48]. 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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Figure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. 1 and 5. Further details of experimental procedures are 
included in S, P. Gygi et at, (submitted). 




4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological researdi 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological researdi tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive. The most common implementation of this idea is 
the separation of proteins by high resolution the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent [49]. 
It is obvious that the protein patterns arc significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of siipplemental information for many of the spots 
which are- positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g,^ enzyme 
activity) of a single component {e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a saniple. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach whidi can be adapted to investigate specific 
systems and pathways^ as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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tive comparative analysis of hundreds to a few thousand 
proteins. Comparative analysis of the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ* 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 




Figure 7. High resolution 2-DE map of proieins isolated from rat serum with or without prior exposure to an inflam- 
mation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion is a 7.5-1 7.5%T gradient SDS-PAGE gel. Proteins were visualized by staining with amide black. Further details 
of experimental procedures are included in (14, 49). 
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Proteome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
pi) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generatlB the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one slate 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is hmited to the proteins detected and is therefore not . 
truly proteome-wide. These limitations in protcomics are 
to a large extent a reflection of the fact that proteins ir> 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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High-throughput technologies, such as proteomic screening and DMA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene. we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly^ this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
Identified a set of relaUvely understudied, yet highly expressed genes In ER negaUve tumors worthy of 
further examination. 

Keywords: bioinformatics • micro-array • text mining • gerie-disease association • breast cancer 



Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g., 
breast cancer, a researcher wpuld liaye had to scan 130 different 
Journals and read 27 papers per day in fd^STThis problem is 
accentuated with high-throughput technologies such as DNA 
mlcro-arrays and proteomics, which require the analysis of 
large datascts involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, tliere 
is a need to summarize existing l^nowiedge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database* and the 
Tumor Gene Database.* However, as these resources are hand- 
curated. the labor-intensive review process becomes a rate- 
limiting step In the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analysing their frequency and 
context This approach has been used successfully in. seVeral 
instances for biological applicaUors. In most cases, it has. been . 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
the literature or by functional annotation.''' Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta et al. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to Identify genes linked to inherited disorders.' 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gene-disease relationships. Including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
wre have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this toot to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedGene is a relational database, stor- 
ing disease and gene information from NC6I. text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interface for users to 
queiy the database (http://hipseq.med.harvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (NaUon Ubrary of Medicine) (http:// 
www.nlm.nih.gov/mesh/meshhome.htmQ and human disease 
categories were selected. LocusUnk files were downloaded from 
the LocusUnk web site at NCBI (httpy/www.ncbi.nih.gov/ 
LocusUnk/). Official/preferred gene symbol, ofTicial/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records witli the relevant primary gene key(s). 

Statistical Methods, For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied; 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chl-square analysis. Fisher s exact probabilities, rela- 
tive risk of gene, and relative risk of disease'^ (http:// 
hipseq.med.harvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefilcients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and (he LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using "Tree Viewer" (http:// 
rana.lbl.gov/EisenSoftware.htm). 
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Breast Tissue Micro-Arrays. Elghty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor Immuno-pheno- 
types of l^reast cancer. Biotinylated cRN A, generated from the 
total RNA exu-acted fn3m the bulk tumor, was hybridized to 
Affymetrix U95A oligo-nucleotlde micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Anymetrix. and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene>disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabubiy from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance.* ** TWenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information. Supplemental Table 1. or visit http^/hipseq. 
med.harvard.edu/MedGene/publication/s^Table l.html) were 
selected from the 2002. MeSH index creadng a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusUnk database at 
NCBI." '* which contains 53 259 independent records keyed 
by an ofnclal gene symbol or name Oune 18*. 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics. If at all. 
For the intended use of dils .study, this lack of dlsUncUon is 
tmlikely to have a laige effect and may in fact be beneilcial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table I). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false posiUve cases, caused by duplicative and unrelated 
meanings for the terms, were more dimcult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
die terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found In the title or 
abstract. Case-insensitive string mapping was used for ail 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of False Positives and False Negatives in Unfiltered Data' 



source of error 



error type 



example 



gene symbol/name false positive 

Is not unique 



gene symbol is felse positive 

unrelated abbreviation 

gene symbol/name false positive 

has language meaning 

nonstandard syntax false negative 

unofficial gene name/symbol false negative 

nonspecified gene name false negative 



MAC-mye\in 

associated glycoprotein 
MA C?-maIlgnancy-associated 

protein 

iM-pallid homologue (mouse). 

pallidin (also abbrev. for Pennsylvania) 
MC45-Wlskott-Aldrlch Syndrome 

(also the word "was") 
B>4C'/ instead oi BAGl 
/>5i instead or TP53 
estrogen receptor instead of 

Estrogen receptor I 



niter solution 



eliminate this term 

eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



• In preliminary studies, l^edline was seaiched for co-occurrence of genes and diseases and the rcsuiUng output was evaluated to tdeolify ciror sources that 
were amenable to global filters. Each error souice Is categorized by the type of error It causes: false poslUvcs are suggested rcbUonshlps that are not real and 
false negaUvcs are real relationships that are underrepreserued. The (Uter solutions used are Indicated. Note that in some cases, the flltcr solution itself introduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity If needed. 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with ail qualifled gene 
identifiers, such as the oHlclal/preferred gene symbol, the 
ofRcial/preferred gene name, ail gene nidcnames and all syntax 
variants. In situations vi^here there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name* e.g.. estrogen receptor instead of 
estrogen receptor 1 (£5/?/). creating a source of false negatives. 
For this reason^ gene family stem terms were created for all 
genes that have an alpha or numerical sufiix (e.g.. IL2RA TGFp, 
ESRl, etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that it would be clear when linltages were made to 
the gene family versus a $i)ecific member in that family. 

To Improve performance and accuracy, some pre-selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second, non-English journals were removed because 
the natural language filters were only relevant to English* 
publications. Finally. Journals unlikely to contain primaiy data 
about gene'disease relationships were also removed (e.g., int 
I Health Edua, Bedside Nurse, and / Heaidi Econ.), Together, 
these filteis reduced the 12 198 221 Medline publications Ouly 
2002) by 37%. 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations» 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
metliods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LP?) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, inclusiveness was 
prioritized. To determine the false negative rate in MedGene. 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the fat^,e negative rate by comparison 
with hand<urated databases, fhe breast cancer-related genes 
identified by MedGene were compared with those listed in 
several other databases including the Tumor Gene Database 
(TGD).2 the Breast Cancer Gene Oatabase(BCG).^ GeneCards 
(GC)^^ and Swissprot.^® Genes were considered false negatives 
if they were represented (n at least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer- related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation in these hand- 
curated databases, 26 were absent from MedGene, suggesting 
a false negative rate of approximately 9%. To determine why 
ihese were missed, all literature references for these genes (80 
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papers) were reviewed manually (see ihe Supporting Informa- 
tion. Supplemental Table 2, or visit http://hipseq.med. 
harvard.edu/MedGene/publicatlon/s_Table 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity Hlters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
mariuscript but not die abstract or title (1.1%). Of note. 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complemeritary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranlced 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of die association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information. Supplemental Table 3. or visit 
http://hipseq.med.harvard.edu/MedGene/publication/ 
sJTable Siitml). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true reladonships. Of the 600 records 
r^ewed. there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the jgene or protein was not shown to be 
related to human prostate cancer." " 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted In most of the false positives (see the Supporting 
Information. Supplemental Table 4, or visit http://hipse- 
q.med.harvard.edu/MedGene/pubiicaUon/s_Table 4.htmD. em- 
phasizing the importance of the filters that were added in the 
seaTx;h algorithm (Table 1), Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
draniatlcally (data not shown). For example, among die papers 
about breast cancer, there were only 12 Medline records that 
referred to ESRl and 10 to ESR2, whereas almost 2000 papers 
mentioned estrogen receptor widiout specifying ESRJ or ESR2: 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information. 
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Supplemental Figure 1, or visit http://hipseq.medharvardedu/ 
MedGene/publication/s^Figure l.html). For example, In one 
such cluster shown in Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a ^en disease can 
be estimated by adjusting die MedGene number up by the false 
negative rate (-9%) and down by the false positive rate (-^26% 
on average). Using this, the average disease has 103.7 ± 45.3 
(mean ± 5.d.) genes associated with it. aldiough the range is 
quite broad with 2359 genes related to breast cancer. 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Ana^is of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes thai had greater than a l-fold difference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene, we sorted the 2286 genes Into 
four classes: 555 genes directly linked to breast cancer In the 
literature by gene term search (first-degree association by gene 
name): 328 genes directly linked by family term search (first- 
degree association by family term): 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association): and 505 genes not previously associated with 
breast cancer. (See the Supporting Information. Supplemental 
Figure 2, or visit http://hipseq.med.harvard,edu/MedGene/ 
publication/s^FIgure 2.html.) Among the 505 previously un- 
related genes. 467 were either newly identified genes or genes 
that had not previously been associated widi any disease. 
Among die remaining 38 genes. 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether die genes highlighted by die micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tis5»je versus the literature score (LPF) 
(Figure 3A). There was a b.t)ad spread of expression changes 
among the genes direcUy linked to breast cancer ranging from 
less than' 1 -fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater dian 10-fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r = 0.018. p-value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had lO-fold change or more in 
expression level, the correlation increased to 0.41 (p-value = 
0,05). 

Wiien we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Diabetic Neuropatbles 
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Brain Bdema 
Pulmonary Edema 
Nutrition Disorders 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here, 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the fine) and siarvalion states terms (under the line) clustered together. Within these groups, there Is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders. elc.(Supplemental Figure 1: http://hipseq.med. 
harvard.edu/MedGene/publication/s_Flgure l .html). 

Finally, to validate our findings, we computed similar cor- disease unrelated to breast cancer. As expected, we did not 
relations between ihe breast cancer expression data and observe an increasing trend in correlation for hyperten- 
LPF scores generated by MedGene for hypertension, a sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer. 3A. The data from an expression analysis of 
samples for breast tumors and normal breast tissue were analyzed to Indicate the fold difference of expression level between breast 
tumor and normal sample (cutoff 2: S-fold change). The fold changes were plotted against the literature score for the same gene sel. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-assoclatlon. Some wel{«slud(ed genes, such as BRCA2 (pink circle), are not reflected by a substantial difference In 
expression level. Furthermore, the majority of genes that have no association with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) and the fold change 
of expression level t>etween tumor and normal breast samples (y-axis) in relation to the amount of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among estrogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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breast neoplasms 


hypertension 


rheumatoid arthritis 


bipolar disorder 


atherosclerosis 




REN 


RA 


ERDAl 


apgi|oprotem 


PGR 


DBF 


TNFR^PlOA 


^NAP2Q 




ERBB2 


LBP 


CRP 


PFKL 


IDLR 


BRCAl 


ACT 


AS 


DRD2 


ELN 


BRCA2 


INS 


ESRl 


TRH 


ARC! 


EGFR 


kaltikrein 


HLA'DRBI 


IMPA2 


APOB 


CYPI9 


ACE 


OR J 


HTR3A 


APOAl 


TFFi 


endothelin 


int^rl^ukln 


DRD3 


MSRi 


PSEN2 


SI00A6 


TNF 


REM 


LPL 


TP53 


BDK 


US 


KCNN3 


PONi 










plasminogen 


CBS3 


DIANPH 


coliagcin 


DRD4 


activator inhibitor 


CBACAM5 


SARJ 


ILiA 


HTR2C 


PLC 












ERBB3 


PIH 


ACR 


RELN 


adhesion molecule 








L/on 


ATOHi 




nLD 






¥ wwr 




rypi ifip 

A I IDC, 




COMT 


INS 






ULO 


MTRPA 


ARG2 




angiotensin 








IHAM 


receptor 


inteneuiun i 


CVAf/f 




matrix 






CCNDI 


ACTR2 


metal toproteir^e 


INPPl 


OLRi 


EOF 


NFPA 


interferon 


NEDD4L 


collagen 


MUCl 


LVM 


CD68 


FRAJ3C 


MCP 








transducer of 




insulin-like 


DBH 


JL4 


ERBB2 


lipoprotein 


BCL2 


NPY 


IL17 


BAIAP3 


AP0A2 










intercellular 


mucin 


POMC 


MMP3 


ATPIB3 


adiieslon molecule 


FGF3 


neuropeptide 


SIL 


DR05 


RAB27A 



• MedGene results for the top 25 genes associated with. breast neoplasms* hypertension, rheumatoid arthritis, bipolar disorder, and atherosclerosis, respectively, 
ranlced by LPP scores. The hyperlink to all the papers co-citing the gene and the disease is available at MedCene website (http://hipseqjnediiarvard.edu/ 

MedCene/). 



Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organi2ation 
and biological systems. High-throughput technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret large gerie ll^ts. MedCene* a relational database 
derived by mining the information In Medline, was created to 
address this need. MedCene users can query for a rank-ordered 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting .each association and to other relevant databases. 

MedQene is an innovative extension of previous text mining 
approaches. Perez-Iratxeta et al. used the GO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders.* MedGene takes a broader view 
and includes all diseases and all possible gene-disease relation- 
ships. Furthermore. MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships.^ 

A unique aspect of this tool is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias'* and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http://hipseq.med.harvard.edu/MedCene/publlcation/s_Ta' 
ble 3.html). Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link: 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implementa- 
Uon of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study; all rf the filters that we applied were general 
ones. e.g.. expanding the list of all gene names to address the 
different syntax forms used by different joumals. eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and diflicult to identify systematically without 
causing a significant rise In false negatives. Alternative ap- 
proaches, such as (he examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments aS small as 2-fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-foId change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Changes \n ER- but 
Not In ER+ Breast Tumors 



MAMA mvMhAl 

gene symDoi 


fold f^anve fl^4-k 


fold chanoe fGR~) 


KRTHBl 


1.0 


610.8 


BRS3 


1.2 


89.4 


DKKl 


1.2 


69,8 


ZlCl 


1.9 


59.6 


TLRI 


1.0 


38.5 


KIM0680 


2.6 


33.2 


CDKN3 


1.0 


30.6 


EBIZ 


4.0 


27.9 


CZMB 


3.8 


21.9 


STK18 


4.7 


18.6 


CPR49 


1.0 


14.6 


MYOlO 


1.6 


14.4 


LADl 


-1.0 


13.5 


P0LE2 


4.2 


13.0 


HMG4 


4.4 


12.9 


BCL2UI 


-1.2 


12.3 


LRP8 


2.9 


12.2 


CCNB2 


1.0 


11.8 


CCNE2 


4.0 


11.6 


FOB 


-4.3 


11.1 


KNSL6 


2.9 


10.9 


H1F5 


3.0 


10.2 


SERPINH2 


4.6 


10.2 


YAPI 


1.0 


to.o 


LPHB 


-1.3 


-10.4 


TCEA2 


-l.l 


-10.8 


TFFl 


1.3 


-11,4 


COU7A1 


-4.1 


-15.7 


POPS 


1.1 


-16.2 


BPACI 


-4.6 


-22.3 


PDZKJ 


-l.l 


-36.8 


VEGFC 


-2.8 


-51.5 


Kfuce ' 


-1.4 


-64.9 


SERPmAS 


-1.0 


r83.l 


MEISl 


-1.6 


-85.9 


CAI2 


2.4 


-150.3 



Table 3. MedGcne idenUHed 9 jcl of relatively understudied, yet highly 
expressed genes in ER negative, iHit not ER positive breast tumors. All of 
these genes have either never been co-clted with breast cancer or have a 
weak association except those marlied with an *. 



reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g.. 
BRCAl and BRCA2^ and genes whose modest changes in 
expression may be unrelated to the disea^^^ Strikingly, among ; 
genes with a 10-fold change or more in expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5-foId expression level changes. Second, any genes 
with 10-fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting exjxirinients that may contain 
subpopulations that behave very differently. The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negative tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database Is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments- Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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of the results, showing that the resulting disease clusters were 
indeed logical (Supplemental Figure 1); and a review of the 
results showing that among the 505 previously unrelated genes. 
467 were either newly identified genes or genes that had not 
previously been associated with any disease (Supplemental 
Figure 2). This material is available free of charge via the 
Internet at http://pubs.acs.org and at the web sites mentioned 
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